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HEPATITIS C VIRUS SUB-GENOMIC REPLICONS 

TECHNICAL FIELD OF THE INVENTION 

The present invention relates generally to the construction of sub-genornic HCV 
replicon systems that may provide the foundation for generating HCV replicons of all six 
major genotypes and subtypes to facilitate screening, testing, and evaluating anti-infective 
agents for HCV disease(s). 

BACKGROUND OF THE INVENTION 

In the U.S., an estimated 4.5 million Americans are chronically infected with hepatitis 
C virus (HCV). Although only 30% of acute infections are symptomatic, greater than 85% 
of infected individuals develop chronic, persistent infection. Treatment costs for HCV 
infection have been estimated at $5.46 billion for the U.S. in 1997. Worldwide, over 200 
million people are estimated to be infected chronically. HCV infection is responsible for 
40-60% of all chronic liver disease and 30% of all liver transplants. The CDC estimates that 
the number of deaths due to HCV will minimally increase to 38,000/yr. by the year 2010. 

Due to the high degree of variability in the viral surface antigens, existence of 
multiple viral genotypes, and demonstrated specificity of immunity, the development of a 
successful vaccine in the near future is unlikely. Alpha-interferon, alone or in combination 
with ribavirin, has been widely used since its approval for treatment of chronic HCV 
infection. However, adverse side effects are commonly associated with this treatment: flu- 
like symptoms, leukopenia, thrombocytopenia, and depression from interferon, as well as 
hemolytic anemia induced by ribavirin (Lindsay, 1997). This therapy remains less effective 
against infections caused by HCV genotype 1, which constitutes -75% of all HCV 
infections in the developed markets, compared to infections caused by the other five major 
HCV genotypes. Unfortunately, only -50-80% of patients respond to this treatment, 
measured by a reduction in serum HCV RNA levels and normalization of liver enzymes. Of 
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those patients treated, 50-70% relapse within six months of cessation of treatment. Recently 
with the introduction of pegylated interferon, both initial and sustained response rates have 
improved substantially, and combination treatment of Peg-IFN with ribavirin constitutes the 
gold standard for therapy. However, side effects associated with the combination therapy 
and the impaired response in patients with genotype 1 present opportunities for 
improvement in disease management. 

First identified by molecular cloning in 1989 (Choo et aL, 1989), HCV is now widely 
accepted as the most common causative agent of post-transfusion non-A, non-B hepatitis 
(NANBH) (Kuo et al. y 1989). Due to HCV's genome structure and sequence homology, this 
virus was assigned as a new genus in the Flaviviridae family. Like the other members of 
the Flaviviridae, such as flaviviruses (e.g., yellow fever virus and Dengue virus types 1-4) 
and pesti viruses (e.g., bovine viral diarrhea virus, border disease virus, and classic swine 
fever virus (Choo et al y 1989; Miller and Purcell, 1990)), HCV is an enveloped virus 
containing a single strand RNA molecule of positive polarity. The HCV genome is 
approximately 9.6 kilobases (kb) with a long, highly conserved, non-capped 5' non- 
translated region (NTR) of approximately 340 bases which functions as an internal ribosome 
entry site (IRES) (Wang and Siddiqui, 1995). This element is followed by a region which 
encodes a single long open reading frame (ORF) encoding a polypeptide of -3000 amino 
acids comprising both the structural and nonstructural viral proteins. 

Upon entry of the RNA into the cytoplasm of the cell, it is directly translated into a 
polypeptide of -3000 amino acids comprising both the structural and nonstructural viral 
proteins. This large polypeptide is subsequently processed into the individual structural and 
nonstructural proteins by a combination of host and virally-encoded proteinases 
(Kolykhalov et aL, 1996). Following the termination codon at the end of the long ORF, 
there is a 3* NTR which roughly consists of three regions: an ~ 40 base region which is 
poorly conserved among various genotypes, a variable length poly (U)/polypyrimidine tract, 
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and a highly conserved 98 base element also called the M 3'X-tail" (Kolykhalov et al y 1996; 
Tanaka et ai, 1995; Tanaka et a/., 1996; Yamada et al y 1996). The 3*NTR is predicted to 
form a stable secondary structure that is essential for HCV growth in chimps and is believed 
to function in the initiation and regulation of viral RNA replication. 

To study the biology of HCV and to screen for compounds that inhibit the virus 
replication, a cell-based system to grow the virus is essential. Recently, cell-based replicon 
systems for HCV were developed, in which the nonstructural proteins stably replicate sub- 
genomic viral RNA in Huh-7 cells (Lohmann et al., Science 285:1 10 (1999) and Blight et 
al., Science 290:1972 (2000)). In the absence of a purified, functional HCV replicase 
consisting of viral non-structural and host proteins, our understanding of Flaviviridae RNA 
synthesis comes from studies using active recombinant RNA-dependent RNA polymerase 
(RdRp) and validation of these studies in the HCV replicon system. Novel chemical 
entities, which interfere with HCV RNA synthesis, are identified by screening compound 
banks for inhibition of recombinant HCV polymerase in biochemical assays. However, 
biochemical inhibition against this purified enzyme does not necessarily translate into 
replicon cell-based inhibition since in the latter system the polymerase exists within a 
replicase complex, associated with other viral and cellular polypeptides in appropriate 
stoichiometry. Moreover, in the absence of a robust small animal model for HCV infection, 
the replicon system more accurately represents an active infection than other in vitro 



However, despite the existence of infectious cDNA clones and many attempts to 
cultivate the virus in established cell lines in laboratories, efficient in vitro replication of the 
HCV virus has not been established prior to the present invention. A sub-genomic HCV 
replication system was successful in the human hepatoma cell line Huh-7 after 
electroporation of a particular genotype lb, BB7 strain, RNA replicon containing a neo 
resistance marker (Lohmann et al, Science 285: 1 10, 1 999). The replicon is a di-cistronic 
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construct containing the 5' untranslated region of HCV (5' NTR), a neomycin resistance 
gene (neo), the non-structural genes NS3 to NS5 and the 3' non-translated region of HCV (3' 
NTR). Translation of neo and NS3 to NS5 genes is mediated by HCV 5' NTR and EMCV 
IRES, respectively. Improved replicons, containing high-levels of viral sub-genomic RNA 
copies were obtained by selection for adapted mutations in Huh-7 cells (Blight, K. J. et al. 
2000; Lohmann et al. 2001; Guo et al. 2001). 

Huh-7 cell-based replicon systems for HCV were constructed so that the non-structural 
proteins replicate sub-genomic viral RNA. One significant limitation of the available 
replicon systems is the inability of other genotypic derivatives, beyond that of two specific 
strains of genotype lb (HCV-N and HCV-BB7), to replicate in Huh-7 cells. The generation 
of functional replicons for HCV genotypes 1 to 6 would be invaluable in efficiently 
developing antiviral agents, and solves a longstanding problem. 

SUMMARY OF THE INVENTION 

In view of the aforementioned deficiencies associated with prior art HCV replicons, 
cell culture systems for the analysis of HCV replication, and the development of therapeutic 
compositions therefor, it is evident that there exists a need in the art for identification of 
chimeric sub-genomic replicons of HCV which can be incorporated into a cell capable of 
replicating RNA transcripts. These transcripts can then be used as target sequences for the 
production of attenuated HCV for vaccines, and targets for therapeutic compositions. 

In accordance with the present invention, nucleotide sequences derived from various 
functional chimeric HCV replicons are provided herein. Through molecular cloning and 
tissue culture technologies as well as detailed analysis of the literature, the present invention 
describes the successful generation of stable cell lines expressing and replicating functional 
replicons, containing sequences from HCV genotype la (strain H77) or genotype lb (strain 
J4) within the prototype lb replicon backbone from HCV strain BB7. The methodology 
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used to create these functional replicons described herein which express (1) genotype la 
sequences or (2) genotype lb sequences from other strains not previously shown to be 
functional in such systems provides the foundation and know-how for generating HCV 
replicons of all six major genotypes and subtypes to facilitate screening, testing, and 
evaluating anti-infective agents for HCV disease(s). 

One embodiment of the invention is a sub-genomic viral replicon that contains a nucleic 
acid construct encoding chimeric HCV nonstructural proteins and a complete NS5B 
polymerase. A further embodiment provides an NS5B encoding sequence linked in cis to a 
3TJTR sequence of an HCV strain, preferably the same HCV strain. 

In another embodiment, a chimeric replicon includes the nonstructural proteins NS3, 
NS4A, NS4B, NS5A, and NS5B. In a further embodiment of the invention, a chimeric 
replicon comprises an NS3 nucleotide sequence that encodes about the first 75 contiguous 
N-terminal amino acids of NS3 of genotype lb. A preferred embodiment of the invention is 
a chimeric replicon that contains the NS3 encoding nucleotide sequence of a genotype lb, 
BB7 strain. The nucleotide sequence that encodes the first 75 contiguous N-terminal amino 
acids of HCV type lb, strain BB7 is 

ATGGCGCCTATTACGGCCTACTCCCAACAGACGCGAGGCCTACTTGGCTGCATCATCACTAGCCTCAC 
AGGCCGGGACAGGAACCAGGTCGAGGGGGAGGTCCAAGTGGTCTCCACCGCAACACAATCTTTCCTGG 
CGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATGGTGCCGGCTCAAAGACCCTTGCCGGCCCA 
AAGGGCCCAATCACCCAAATG ( SEQ ID NO : 1 ) . 

In still another embodiment of the invention, a chimeric replicon comprises an NS3 
from any of the other six major HCV genotypes and subtypes wherein a particular NS3 has 
its N-terminal first 225 nucleotides replaced by the N-terminal first 225 nucleotides of NS3 
of the genotype lb, BB7 strain. For example, a preferred embodiment of the invention 
comprises a replicon wherein the N-terminal sequence of NS3 genotype lb, BB7 strain, 
replaces the corresponding N-terminal sequence of NS3, genotype la. In addition, another 
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embodiment is provided wherein a NS3 sequence is from the genotype la, H77 strain. In 
yet a further embodiment, an NS3 sequence is from the genotype lb, J4 strain. 

Another embodiment of the invention is a sub-genomic viral replicon that comprises a 
nucleic acid construct encoding chimeric HCV nonstructural proteins, and at least the C- 
terminal end of a strain specific NS5B polymerase gene linked in cis to a 3UTR sequence 
from said strain. Specifically, a preferred embodiment of the invention is a chimeric 
replicon wherein the NS5B comprises sequence from both a BB7 strain and a J4 strain, see 
Figure 2(b), the J4M/S construct. The C-terminal portion of BB7's NS5B is linked in cis to 
a 3TJTR sequence of a BB7 strain. However, the N-terminal portion of NS5B is sequence 
from the J4 strain. In an embodiment of the invention, there is provided a construct wherein 
less than the complete NS5B from a single HCV strain is used because both BB7 and J4 are 
of the same subtype, lb. 

In another embodiment of the invention, a replicating HCV sub-genomic replicon 
comprises SEQ ID NO:2, SEQ ID NO:6, SEQ ID NO:7, or SEQ ID NO:8. 

A further embodiment of the invention includes a method for introducing a HCV 
replicon into a cell. The method produces a cell comprising a replicating chimeric HCV 
sub-genomic viral replicon. A preferred embodiment a cell of the invention comprising a 
HCV sub-genomic replicon further comprising all of the non-structural HCV genes and 
none of the structural HCV genes. 

A further embodiment of the invention is a method of screening for compounds that 
modulate viral replication comprising the steps of administering a test compound to a cell 
comprising a replicating chimeric HCV sub-genomic viral replicon and determining whether 
said test compound modulates replication of said sub-genomic replicon. 

Another embodiment of the invention includes a method of screening for compounds 
that inhibit viral replication comprising the steps of administering a test compound to a cell 
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comprising a replicating chimeric HCV sub-genomic viral replicon and determining whether 
said test compound inhibits replication of said chimeric sub-genomic viral replicon. 

BREIF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a comparative sequence alignment of 3TJTR coding region from 
genotype la and lb strains. 

Figure 2 schematically depicts the novel replicon constructs in comparison with the 
prototype pHCVreplb(BB7). Replicon pHCVreplb(BB7) contains noncoding sequences in 
thin lines and protein coding sequences in boxes (neo=G418 resistance gene; NS3 to NS5B 
= nonstructrual protein genes). The arrow on top of the figure shows the translation start 
codon AUG and the NS5A adaptive mutation Serine 1 179 Isoleucine. The arrows below the 
Figure indicate the relevant restriction sites in the DNA plasmid used for sub-cloning. B, 
Bsr GI; RI, Eco RI; M, Mfe I. Stripped boxes indicate DNA sequences derived from pCVJ4, 
Shaded boxes indicate sequences derived from pCVH77. 5* and 3' NTR sequences are 
indicated while the nucleotide changes in 3NTR are pointed out by the nucleotide under the 
3 'NTR. Replication capability was indicated by + and - signs at the left of the figure. 

Figure 3 depicts Huh-7 cell colony formation. Colonies formed after selection in G418 
containing media for over four weeks were stained with 5% crystal violet in methanol and 
photographed. 

Figure 4 depicts a Western blot of stable replicon cells expressing NS5A. Total 
replicon cell protein was separated on 8% SDS-PAGE, transferred to nitrocellulose paper 
and probed with anti-NS5A serum. Replicon cell A serves as positive control, while Huh-7 
cell as negative control. The molecular weight is indicated. 

Figure 5 illustrates an analysis of replicon genomic DNA by PCR. Genomic DNA 
from each replicon cell line was purified and used as substrates in PCR reactions to amplify 
either GDAPH or Neo resistance gene. While in both H77 1 A (Fl) and J4 M/S cells, DNA 
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for GAPDH was amplified, indicating that the good quality of genomic DNA, no Neo DNA 
was detected. For positive control of the reaction, amplification of neo DNA was from a 
neo plasmid DNA. 

Figure 6 illustrates a genome copy number determination by TaqMan analysis. 

DETAILED DESCRIPTION OF THE INVENTION 

Through molecular cloning and tissue culture technologies, the present invention 
produced functionally stable cell lines expressing and replicating replicon RNA consisting 
of sequences from HCV genotype la, strain H77, within the prototype genotype lb replicon, 
strain BB7, backbone. The type la replicon system is characterized in detail and discussed 
herein. 

Specifically, expression of HCV viral proteins, NS3, NS5A, and NS5B, were detected 
by Western blot and in situ immunofluoresence analysis. Positive- and negative-strand 
replicon RNA was quantified by TaqMan and full length RNA was confirmed by northern 
blot analysis. Susceptibility to interferon and antiviral agents was shown to be similar when 
comparing with the type lb replicon system. Lastly, cDNA was generated from replicon 
cell RNA by RT-PCR, and as many as 18 clones were sequenced to identify adaptive 
mutation(s) not present in the parental H77 sequence. The non-structural genes NS3, NS5A, 
and NS5B contained four mutations. While the two mutations in NS3 were found in all 
clones, the mutations in NS5A and NS5B were always present in a same subset of clones. 

Huh-7 cell-based replicon systems for HCV were developed, in which the nonstructural 
proteins stably replicate sub-genomic viral RNA. (Lohmann etal. y Science 285:1 10 (1999) 
and Blight et al. y Science 290:1972 (2000)). Although two specific strains of genotype lb, 
(HCV-N and HCV-BB7), have been confirmed to be functional in Huh-7 cells, it is not clear 
why other strains of genotype lb are unable to replicate. The present invention is based, in 
part, on understanding the interplay of how certain adaptive mutations confer competence in 
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Huh-7 cells for one viral strain, yet altogether different mutations are critical for a different 
viral strain would facilitate the creation of functional replicons for HCV genotypes 1 to 6. 

These functional chimeric replicons are valuable tools that enable one skilled in the art 
to adapt known replicon systems to other HCV genotypes, such as HCV type la strain H77 
(Yanagi, M. et. at. 1997) and HCV type lb strain HC-J4 (Yanagi, M. etal 1998), which 
have not been reported as successful. The reported cDNA for genotype lb, strain J4, was 
constructed using the 5* and 3' untranslated regions of a genotype la strain (Yanagi, M. et al. 
1998). The infectivity of three full-length cDNA clones was tested by direct injection of 
RNA transcripts into the liver of a chimpanzee and only one of the three clones was found 
to be infectious. The infectious clone coding region contained three amino acid changes 
from the parental J4 strain. Moreover, heterogeneity of the 3UTR was examined and 
several changes compared to the parental genotype la 3UTR were identified (nt 9407, 
9399, poly U-UC region varied in length and complexity, and several point mutations in the 
conserved region of the 3TJTR. Although not mentioned by these authors, it is provided by 
this invention that these in v/vo-selected changes indicate that the generation of functional 
HCV genomes requires a specific NS5B polymerase sequence (e.g., J4 strain polymerase 
sequence) to be linked in cis to its cognate 3TJTR sequences (e.g., J4 strain 3TJTR 
sequence). Replication of negative strand by NS5B initiates at the 3TJTR, and coordination 
of specific sequences in these two regions directs function. The present invention further 
provides generation of chimeric replicons containing genotype lb sequence from the J4 
strain that are nonfunctional when paired with 3UTR sequences from genotype la strains. 
The invention thus explains the lack of reported J4 strain replicons to date, but in no way is 
the invention limited by this hypothesis. 

The present invention provides certain sub-genomic replicon constructs. A replicon 
comprising both the polyprotein coding sequences and 3TJTR sequences from a genotype 
la, strain H77, cDNA is functional, based on the in cis pairing of NS5B and 3UTR from 
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same strain. Moreover, functional replicons are provided wherein all polyprotein coding 
sequences except NS5B were replaced by sequences of genotype lb, strain J4, while leaving 
NS5B and 3UTR sequences intact from genotype lb sequences reported in the BB7 lb 
functional replicon. Lastly, based on the 3UTR sequences in Figure 1, replicons containing 
solely J4 strain sequences function if the genotype la 3UTR from the infectious cDNA is 
replaced with the 3UTR from genotype lb, strain J4. Data confirming this result is 
provided elsewhere herein. 

In addition, the hybrid clones of the present invention retained a portion of the 5' 
sequence of the non-structural gene NS3, genotype lb, of the BB7 replicon. The first 225 
nucleotides of genotype lb replaced the corresponding 5' end of the non-structural gene NS3 
of either genotype la or genotype lb, J4 strain. 

The present invention discloses a panel of constructed BB7 strain chimeras, among 
others. These chimeras comprise sequences from the infectious strains J4, type lb or H77, 
type la. The J4/BB7 chimeric replicons comprising coding sequence for NS3, NS4A, 
NS4B, NS5A, and 132 amino acids of NS5B from J4 and the remainder of the replicon from 
BB7 efficiently replicated in Huh-7 cells. However, replacement of BB7 NS5B and 3NTR 
with J4 strain sequences failed to result in stable cell line generation, although transient 
replication was not ruled out. The H77/BB7 chimeric replicons containing NS3, NS4A, and 
NS4B sequences from H77 supported replication in Huh-7 cells, although replacement of 
BB7 NS5A and the N-terminal 132 amino acids of NS5B with H77 sequences rendered this 
replicon deficient. Lastly, replacing a portion of BB7 NS5B and 3NTR elements with H77 
sequences also did not confer stable replication in Huh-7 cells. Surprisingly, a chimeric 
replicon whereby the entire BB7 NS5B was replaced with H77 sequences resulted in 
efficient colony formation and stable replication in Huh-7 cells. Adaptive mutations were 
found in the H77 NS5B region of this replicon. Such replicons could be used for cell-based 
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testing of antiviral agents, among other uses, against NS3 and NS5B from strains J4 and 
H77. 

In the present invention, certain chimeric replicon constructs were generated from 
pHCVreplb(BB7) with sequences substituted with either type la HCV H77 sequence or by 
HCV type lb J4 strain sequence. Rather than trying to replace only the NS5B coding 
region, as other groups have attempted but failed to achieve, the present invention was 
based, in part, on the methodical replacement of sections of the pHCVreplb(BB7) 
nonstructural genes with la or J4 lb sequences. Replicon RNA prepared in vitro for the 
majority of these chimeras (BB7-F1/F2, BB7-F3, BB7-F3(C), J4B/R1, J4 B/Rl (C), J4 
replicon, J4 replicon (c), and BB7/J4NS5B, see Figure 2) failed to yield stable cell lines 
after transfection into Huh-7 cells. However, surprisingly, constructs BB7-F1, 
BB7/H77NS5B, HCV la replicon and J4 M/S were able to stably replicate and confer 
resistance to G418. The methods for constructing these replicons are described herein. 

Evaluation of the functional replicon constructs demonstrated certain findings that are, 
in certain embodiments, important for chimeric replicon generation of multiple genotypes 
and subtypes: (1) A portion of NS5B, starting at residue 133, from strain BB7 genotype lb 
can confer replication, albeit at low levels and (2) the 3NTR sequences from BB7 can 
confer replication of the replicon, albeit also at loow levels. Both the 3NTR and a portion 
of residues from NS5B are, in certain embodiments, important components which together 
control replication of the sub-genomic RNA in Huh-7 cells. This finding indicates that 
inclusion of nonstructural gene sequences from other strains and genotypes results in a 
functional replicon if sequences for NS5B and 3NTR region are from the same strain, for 
example BB7. 

The invention further provides that other NS5B sequences can replace BB7 NS5B 
sequence, if paired with their cognate 3NTR sequence elements. Moreover, certain 
sequence changes in the BB7 3*NTR, compared to changes in the J4 3NTR indicate the 
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basis of NS5B:3'N'rR interaction, and thereby confer stable replication in Huh-7 cells. 
Figure 1 shows an alignment of 3NTR sequences from different strains. 

Various definitions are used throughout this document. Most words have the meaning 
that would be attributed to those words by one skilled in the art. Words specifically defined 
either below or elsewhere in this document have the meaning provided in the context of the 
present invention as a whole and as typically understood by those skilled in the art. 

As used herein, the term "replicon" refers to a viral nucleic acid that is capable of 
directing the generation of copies of itself. As used herein, the term "replicon M includes 
RNA as well as DNA, and hybrids thereof. For example, double-stranded DNA versions of 
HCV genomes can be used to generate a single-stranded RNA transcript that constitutes an 
HCV replicon. Generally, a viral replicon contains the complete genome of the virus. 
"Sub-genomic replicon," as used herein, refers to a viral nucleic acid that contains 
something less than the full complement of genes and other features of the viral genome, yet 
is still capable of directing the generation of copies of itself. For example, the sub-genomic 
replicons of HCV described below contain most of the genes for the non-structural proteins 
of the virus, but are missing most of the genes coding for the structural proteins. Sub- 
genomic replicons are capable of directing the expression of all of the viral genes necessary 
for the replication of the viral sub-genome, replication of the sub-genomic replicon, without 
the production of viral particles. 

An HCV sub-genomic replicon, may be derived from any of the various HCV strains 
and isolates, such as, but not limited to, any of the isolates from genotypes 1, 2, 3, 4, 5 or 6 
of HCV. Moreover, the various genes included in the sub-genomic replicon can be derived 
from different strains. The complete genotypes of many of these strains are known. See, 
e.g., U.S. Patent No. 6,150,087 and GenBank Accession Nos. AJ238800 and AJ238799, 
International Publication Nos. WO 89/04669; WO 90/1 1089; and WO 90/14436. Moreover, 
the genes included in the sub-genomic replicon may be full-length, fragments or variants of 
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the native sequence, so long as the sub-genomic replicon remains capable of expressing the 
viral genes necessary for replication thereof, without producing viral particles. Thus, for 
example, the genes included in the sub-genomic replicon may be homologous to the native 
genes. 

The phrase "stably replicating" as used herein in reference to the sub-genomic replicons 
means the steady, continuous generation of new sub-genomic replicons in the cells into 
which initial sub-genomic replicon transcripts are introduced, as well as their progeny cells. 
The transfected cells continue to proliferate, and the sub-genomic replicons continue to 
replicate. 

The term "cell" as used herein refers to single cells as well as to the collection of cells 
in culture derived from a single progenitor cell, otherwise referred to as cell lines. 

As used in this specification and the claims, the singular forms "a," "an" and "the" 
include plural references unless the content clearly dictates otherwise. Thus, for example, 
reference to "a cell" includes a mixture of two or more cells. 

Many different cell types can be used to practice the invention. In a preferred 
embodiment of the invention, the cell used is a human liver cell. More preferably, the cell is 
a hepatocellular carcinoma-derived cell. In a non-limiting example, the cell is Huh-7 
(Nakabayashi etaL, 1982, Cancer Res., 42:3858-3863; Seki etaL, 1999, 
Hepatogastroenterology, 46:2812-2817). In another non-limiting example, the cell is 
HepG2 (U.S. Patent No. 4,393,1 33). Other cell lines in which the invention may be 
practised include, but are not limited to, myc immortalized human liver cell lines, and 
primary cultures of fetal hepatocytes (Sanchez et al. 9 1 995, J. Cell Physiol, 165:398- 405). 

Another aspect of the invention provides methods of screening for compounds that 
modulate replication of viral RNAs either directly or indirectly. Compounds can be 
screened for their effect on the replication of sub-genomic viral replicons in the cells of the 
invention by treating the cells with test compounds. Compounds that target the viral genes 
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and/or proteins involved in the replication of sub-genomic viral repl icons can be identified 
in screens of the invention where an HCV sub-genomic replicon can stably replicate. 

The cells of the present invention can be used to identify compounds that inhibit viral 
RNA replication, and hence, viral replication, or to identify compounds that enhance viral 
RNA replication, and hence, viral replication. In particular, compounds identified as having 
inhibitory effects on the replication of HCV sub-genomic replicons will be candidates for 
use as drugs in the treatment of HCV infection and disease. Compounds exhibiting 
replication-enhancing activities will be candidates for use in the development of further 
cellular and animal model systems of HCV replication. 

There are a variety of HCV targets for test compounds, including, but not limited to, 
HCV internal ribosomal entry sites, HCV NS3 serine proteinase, NS3 RNA helicase, NS5B 
RNA dependent RNA polymerase, and other HCV non- structural proteins. For example, 
compounds may interfere with the process of viral replicon replication by interfering with 
the viral proteins that are critical to RNA replication, all of which are translated off of the 
transcripts being generated in HCV sub-genomic replicon-containing cells of the present 
invention. 

When a selectable drug resistance marker is included in the sub-genomic replicon, 
compounds can be assessed for their ability to sensitize cells to the selectable drug, i.e., to 
render the cells sensitive to the drug that was used to select them. Test cultures where cells 
die off are indicative of compounds that interfere with replicon replication, because loss of 
the drug-selectable replicon renders the cells sensitive to that particular drug. Where, for 
example, a neo resistance marker is used in conjunction with the viral sub-genomic replicon, 
loss of the neo-selectable, sub- genomic replicon will render the cells sensitive to G418. 

As used herein, the term "compound 0 means any identifiable chemical or molecule, 
including, but not limited to small molecules, peptides, polypeptides, proteins, sugars, 
nucleotides, or nucleic acids. Such compounds can be natural or synthetic. 
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As used herein, the term "modulates" in reference to host replication activity means 
results in a change in the amount, quality, or effect of a particular response or activity. Both 
increases and decreases in the response or activity are included. 

As used herein, the term "chimeric" means a molecule of RNA, DNA, or protein that 
has resulted from recombination, or has resulted from DNA from two sources fused or 
spliced together. 

The invention is further illustrated by way of the following examples which are 
intended to elucidate the invention. These examples are not intended, nor are they to be 
construed, as limiting the scope of the invention. It will be clear that the invention may be 
practised otherwise that as particularly described herein. Numerous modifications and 
variations of the present invention are possible in view of the teachings herein and, 
therefore, are within the scope of the invention. 

EXAMPLES 

Example 1. Construction of replicons comprising HCV type la H77 sequence. 

The present invention utilized PCR, in which pCV-H77C DNA was used as template to 
amplify a DNA fragment (Fl) which contains NS3, starting at amino acid 76, NS4A, NS4B, 
and the N terminal of NS5A, ending at amino acid 148. The primers are designed according 
to the pCV-H77C DNA sequence with only one nucleotide change in each of the 5' and 3' 
primers. A single T to C change in the 5' primer BG1000 incorporated a BsrGI restriction 
site while a single T to C change in the 3' primer BG1002 created an Eco RI restriction site. 
Neither changes in the DNA sequence caused changes in the encoded amino acid sequences. 
The PCR product was digested with Eco RI and Bsr GI restriction enzymes and cloned into 
pHCVreplb(BB7) vector that has been digested with the same enzymes. The resulting 
chimeric construct, pBB7-Fl was selected based on the lack of a Mlu I restriction site in the 
substituted H77 sequence and confirmed by DNA sequencing analysis. 
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PCR was utilized to amplify DNA fragment F2 by using H77 DNA as template and the 
primers BG1001 and BG1004. F2 extends from within NS5A (starting at amino acid 149) 
to NS5B (ending at amino acid 132), encompassing the SI 1791 adpative mutation. BG1001 
is complementary to BG1002 and contained a single A to G nucletide change and an Eco RI 
restriction site. Similarly, a T to A change was built in BG1004 from the H77 sequence to 
create an Mfe I restriction site. To make the SI 1791 adaptive mutation, the 1.3 kb F2 was 
digested with Eco RI and Mfel and cloned into the corresponding sites in pLitmus38 and 
subjected to oligonucleotide-directed mutagenesis with laS9484I and laS9484I R. The 
mutagenized fragment was subsquently cloned into pBB7-Fl that has been digested with 
Eco RI and Mfe I to produce pBB7-Fl/F2. 

An additional DNA fragment was amplified from H77 DNA by PCR to generate an 
F3(c) fragment containing H77 HCV la sequence from NS5B, at the end of F2, starting at 
amino acid 133, to the 3* end of HCV genome. The 5' primer BG1003 contains an Mfel site 
and the 3* primer RB8000 contains a Spe I restriction site followed by a Sea I site. The Seal 
site marks the end of the HCV genome in that after Sea I digestion and transcription the 
authentic RNA 3' was produced. The RB8000 oligonucleotide has two changes from H77 in 
sequence, an A to T change at the - 3 position and A to T change at the - 44 position. The 
changes reflected the DNA sequence in pBB7 and prevserved the base-pairing structure of 
the 3' X tail (Yanagi, M etal. 1999). F3(c) fragment was digested with Mfe I and Spe I and 
purified from agose gels. It was ligated into pBB7-Fl/F2 vector that was digested with Mfe 
I and Spe I to geneerate pBB7-Fl/F2/F3(c) or HCV 1 A replicon. The F3 (C) fragment was 
also directly cloned into pHCVreplb(BB7) to replace the corresponding DNA to make 
pBB7-F3(c). Similarly, Mfe I -Spe I fragment without the single nucleotide change F3 was 
cloned into pHCVreplb(BB7) to generate pBB3-F3. 

A replicon containing HCV type la NS5B, pBB7/H77NS5B, was constructed in 
multiple steps. First, a unique Sna BI site was introduced into pHCVreplb(BB7) at the end 
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of the NS5B gene. To accomplish this, two DNA fragments were amplified by using 
pHCVreplb(BB7) as template and the primer pairs: rb6000 
CGTCTGCTGCTCGATGTCCTAC and RB7801 3' [SEQ ID NO: 15] 

CTCCCCC AACCGATGAACGGG£ACGTAAAC ACTC C AGGCC AATAG ; BG1005 [SEQ ID NO: 16] 
GCACTAGTACTTGATCTGCAGAGAGGC and RB7801 5* [SEQ ID NO: 17] 
CTATTGGCCTGGAGTGTTTACGTACCCGTTCATCGGTTGGGGGAG. [SEQ ID NO: 1 8] 

RB7801 5* and RB7801 3' are complementary primers derived from sequences at the end 
of NS5B. Two nucleotide sequences (underlined sequences) were changed from the BB7 
HCV sequence so that a Sna BI site is created. The sequence changes do not affect the 
NS5B sequence since they are 3' to the NS5B coding region. By using RB7801 3' and 
rt>6000, a 1.4 kb NS5B fragment was generated. By using RB7801 5' and BG1005, a 220 
bp 3NTR sequence was generated. The two fragments were purified, annealed and the 
annealed product used as templates in PCR reactions to generate NS5B-3TSFTR fragment. 
This fragment was digested with Bel I and Spe I and the resulting DNA ligated into pBB7 
vector that were digested with the same restriction enzymes. The resulting plasmid, 
designated pBB7-SN, was confirmed to contain a Sna BI restriction site at the end of NS5B 
by restriction digestion and DNA sequecing. The NS5B gene fragment from pBB7-SN 
DNA was then replaced with other NS5B genes by cloning into the Bell and SnaBI 
restriction sites. Replicon RNA transcribed from pBB7-SN can replicate efficiently when 
electroporated into Huh-7 cells. To generate pBB7/H77NS5B, the HCV type la H77 NS5B 
gene was first amplified by using H77 DNA and primers RB7801 3* and 1 A 10501 
CCTGGACAGGCGCACTGATCACC [SEQ ID NO: 19]. Two nucleotides were changed 
from the type la sequence to create a Bel I site which caused a V to I substitution in H77 
NS5B. After digestion with Bel I and Sna BI, the H77 NS5B fragment was ligated to 
pBB7-SN vector prepared by digestion with the same restriction enzymes. Replacement of 
BB7 NS5B by H77 NS5B was confirmed by the introduction of an Eco RI restriction site. 
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Similarly, pBB7/J4NS5B was constructed by cloning into pBB7-SN PCR fragments 
generated by using the primer pairs rb6000 and RB7801 3' and pCVJ4L6S DNA. 

For synthesis of all other hybrid J4 replicons, multiple cloning steps were used. First, 
pCVJ4L6S DNA was digested with BsrG I and Mfe I and the 4.3 kb DNA fragment 
encoding NS3 (from amino acid 75), NS4A, 4B, NS5A, and the N terminal 132 amino acids 
of NS5B was purified from agrose gels. The fragment was ligated with pHCVreplb(BB7) 
vector digested with the same restriction enzymes to generate pBB7-J4. pBB7-J4 was 
distinguished from pHCVreplb(BB7) by Hpa I digestion. pBB7-J4 has only one Hpal site 
while pHCVreplb(BB7) has two. To put together the J4 replicon, primers 
J4-10861: GAGGACTTGCTGGAAGACACTG [SEQ ID NO:20] 
and BB7980: CAGGAGTACTTGATCTGCAGAGAGGC [SEQ ID NO:21] 
was used in PCR reactions to amplify the J4 DNA fragment 3' to the Mfe I site. The DNA 
was digested with Mfe I and Sea I restriction enzymes and cloned into pBB7-J4 to create 
pJ4/BB7. To add the NS5A adapatation mutation SI 1791 in pHCVreplb(BB7) to 
pJ4/BB7, the 1.3 kb Eco RI to Mfe I DNA fragment encoding part of NS5A surrounding the 
S 1 1791 was subcloned into pLitmus 38. The resulting plasmid was subjected to 
oligonucleotide-directed mutagenesis to introduce the SI 1791 change by using the oligos J4- 
984 IT and J49841TR. The mutagenized Eco RI - Mfe I fragment was used to replace the 
wildtype fragment in pJ4/BB7 to generate pJ4replicon. Regions containing the mutation in 
pJ4replicon were sequenced and both the J4 sequences and the SI 1791 mutation were 
confirmed. The pJ4replicon DNA also has the single nucleotide change (A to T at -40 
position) in its 3'NTR X tail. This was fixed by first generating a PCR fragment by using 
the primer pairs BB8000 and J4- 10861 and then replace the Mfe I -Spe I fragment in pJ4 
replicon with the PCR fragment. The replicon with the corrected 3* X tail is designated pJ4 
replicon(c). The Mfe I- Spe I and BsrGI - Eco RI fragments from pHCVreplb(BB7) were 
used to replace the corresponding sequence in pJ4 replicon(c) to make pJ4M/S and pJ4 
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B/Rl (c), respectively. The BsrGI-EcoRI fragment from pHCVreplb(BB7) was also used 
on pJ4replicon to make pJ4 B/RI, which differs from pJ4B/RI(c) by a mutation in the 3'X 
tail. 

RNA transcribed from the above replicon DNA was electroporated into Huh-7 cells 
according to the procedures in the examples. G418 resistant cells were obtained by 
incubating the cells in G418-containing media. Once the cells were amplified, total cellular 
protein was separated on SDS-PAGE and probed by NS5A serum to assess whether HCV 
proteins were expressed. As predicted, all three replicon cell lines expressed NS5A as 
shown on figure. Genomic DNA and RNA were also isolated from each replicon cell lines. 
PCR reactions on genomic DNA were carried out to assess the possibility that the G418 
resistance was due to expression of the phosphatransferase from integrated neo resistant 
gene into the Huh-7 cell genome. As shown in figure 4, while the PCR reaction amplified a 
fragment of neo DNA when using genomic DNA from a neo resistant cell line was used as 
substrate, no neo specific product was amplified from equivalent amount of genomic DNA 
from pJ4 M/S, pBB7-Fl and pHCVreplb(BB7). However, when the same genomic DNA 
was used to amplify a GAPDH DNA fragment, the expected sized DNA was amplified 
readily, suggesting that the genomic DNA is of good quality. Therefore, the date indicate 
that it is unlikely that the resistance is due to integration of residue DNA in our RNA 
preparation into the genome of Huh-7 cells. 

Next, replicon RNA was quantified by TaqMan. Real-time PCR was conducted with 
primers and probes in the neo gene on total RNA isolated from the replicon cells and on 
standard neo containing DNA. Copy numbers of the replicon were caculated based on 
standard curve generated from known amount of in in vitro transcribed RNA. J4 M/S 
contained about 147 copies/cell, BB7- Fl contained 57 copies/cell, HCV la replicon has 
165 copies/cell, BB7/H77NS5B has 29 copies/cell. For a positive control, HCV BB7 
replicon that has 1500 copies/cell while Huh-7 cells served as negative controls. 
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Example 2. Cells, media, DNA cloning. 

Huh-7 cell lines were cultured in Dulbecco's Modified Eagle Media (DMEM) 
(Invitrogen #11965-084) containing, 10% fetal calf serum (FCS) (JRH Biosciences #12103- 
78P), 1% penicillin-streptomycin (P-S) (Invitrogen #15140-122), 1% non-essential amino 
acids (NEs) (Invitrogen #1 1 140-050), and 1 mg/ml Geneticin (herein ,, G418 M ) (Invitrogen 
11811-023). 

Example 3. Construction of hybrid H77 replicons. 

PCR was carried out to amplify a DNA fragment (Fl) which contains NS3 (from 
amino acid 76), NS4A, 4B and the N terminal of NS5 A ( 148 amino acids) by using HCV-H 
type la DNA as substrate. The 100 ul reaction contains lx pfu PCR buffer (Strategene), 
0.25 uM BG1000, 0.25 uM BG1002 primers, 10 ng HCV type 1A DNA, 2 units turbo pfu 
DNA polymerase. The PCR conditions were set at 95C for 30 seconds, 60C for 30 seconds, 
72C for 3 minutes for 25 cycles. The DNA product was purified from an agrose gel by 
Qiagen Quick gel kit, digested with BsrGI and EcoRI restriction enzymes. After 
purification, the DNA frgment was ligated into pHCVreplb(BB7) vector DNA that was 
previously digested with BsrGI and EcoRI enzymes. The recombinant plamids were 
distingushed from backgroud pHCVreplb(BB7) plamids by DNA digestion with Mlu I. 
pHCVreplb(BB7) has a Mlu I site in the substituted region, while recombinant pBB7 Fl 
lacks the Mlu I site. pH77 Fl DNA was prepared and sequence confirmed to contain type 
1A sequences. 

PCR reactions were carried out to amplify another DNA fragment (F2) by using H77 
DNA as template and the primers BG 1001 and BG1004. F2 extends from within NS5A to 
the 132 amino acid of NS5B, encompassing the SI 1791 adaptive mutation. BG1001 is 
complemenatry to BG1002, thus containing a single nucleotide change, A to G, and an Eco 
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RI restriction site. Similarly, a T to A change was built in BG1004 from the H77 sequence 
to create a Mfe I restriction site. The 1.3 kb PCR product was digested with Eco RI and 
Mfel and cloned into the corresponding sites in pLitmus38 and subjected to oligonucleotide- 
directed mutagenesis with the oligos J4-9841T and J49841TR. The mutagenized fragment 
was subsequently cloned into pBB7-Fl that has been digested with Eco RI and Mfe I to 
produce pBB7-Fl/F2. 

Another DNA fragment was amplified from H77 DNA by PCR to generate a F3(c) 
fragment which contains H77 HCV 1 A sequence from the end of F2 to the 3* end of HCV 
genome. The primers are BG1003, which was designed to contain an Eco RI site and 
RB8000, which contain a Spe I restriction site right next to a Sea I site. The Seal site marks 
the end of the HCV genome in that after Sea I digestion and transcription the authentic RNA 
3' was produced. The oligonucleotide has two changes from H77 in sequence, an A to T 
change at the minus 3 position and T to A change at the minus 44 position. The changes 
reflected the DNA sequence in pBB7 and preserved the base-pairing structure of the 3'X 
tail (Yanagi, M et al. 1999). F3(c) fragment was digested with Mfe I and Spe I and purified 
from agarose gel. Then ligated into pBB7-Fl/F2 vector that was digested with Mfe I and 
Spe I to generate pBB7-F1/F2/F3(c) or HCV 1A replicon. The F3(c) fragment was also 
ligated into pBB7 vector to generate pBB7-F3(c). Another construct was generated by 
cloning an F3 fragment containing the mismatch in the 98 nt x-tail. The -44 position was 
left to have the T, but the -3 position was changed to T to create the Sea I site. This 
construct is designated pBB7-F3. 
Table 1. 

The restriction sites and there positions as used in the construction of the replicons. 

BsrGI nt 2024 (NS3 aa#75) 

Eco RI nt 5083 (NS5A aa#148) 

Mfe I nt 6383 (NS5B aa#l 32) 

Sna BI nt 7764 (3'NTR) 
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Table 2. 

Summary of nucleotide and amino acid changes in HCV la replicon and BB7/H77NS5. 
Changes in HCV la replicon: 



Position 



Nucleotide Change 



Amino acid change 



I. BB7 adaptive mutation: 
5336 GtoT 



NS5A S232I 



II. HCV la adaptive muation: 

2797 TtoC 

3550 A to G 

4682 AtoT 

6520 G to A 

6936 C to A 

7356 G to A 



NS3 S332P 
NS3 K583E 
NS5AE14V 
NS5B V179I 
none 
none 



III, changes made during cloning: 

2028 T to C (create BsrGI site) none 

5088 TtoC (create Eco RI site) none 

6387 A to T (create Mfe I site) none 



Mutations in BB7/H77NS5B: 

6012 CtoG NS5BV11I 

6013 G to A NS5BV11I 
7024 GtoT NS5B A348S 

Example 4. Construction of hybrid J4 replicons. 

pHCV J4 DNA was digested with BsrG I and Mfe I and the 4.3 kb DNA fragment 
containing type IB NS3 (from amino acid 75), NS4a, 4B, NS5A, and the N terminal 132 
amino acids of NS5B was purified from agrose gels. The fragment was cloned into 
pHCVreplb(BB7) vector DNA that was digested previously with the same restriction 
enzyme to generate pJ4M/S*. To add the NS5A adapatation mutation SI 1791 in 
pHCVreplb(BB7) to pJ4 M/S* t a 1.3 kb Eco RI to Mfe I DNA fragment from 
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pHCVreplb(BB7) was subcloned into pLitmus 38 and subjected to oligonucleotide-directed 
mutagenesis. The oligonucleotides 

J4-9841T: CTTTAGCCAGCTCATCAGCTATCCAGTTGTCTGCGCCTTC 
[SEQ ID NO:22] and 

J49841TR:GAAGGCGCAGACAACTGGATAGCTGATGAGCTGGCTAAAC 
[SEQ ID NO:23] were used in PCR reactions according to the manufacturer's 
directions. The mutagenized Eco RI- Mf el fragment was then cloned into pJ4M/S* 
to generate pJ4M/S. DNA of pJ4M/S was sequenced to confirm both the J4 
sequences and the S 1 1791 mutation. 

Example 5. In Vitro Transcription and DNase digest of Replicon RNA. 

Replicon plasmid DNA was linearized with Sea I (New England Biolabs) using 
1U/1 Dg DNA for 1 hour at 37°C. The linearization of the plasmid generates the authentic 3' 
end of the HCV 3NTR and provides the template to generate the appropriate RNA 
transcript from the upstream T7 promoter. The DNA was subsequently purified in two 
phenol:chloroform:IAA extractions, followed by an EtOH precipitation. The linearized 
DNA pellet was dried and resuspended in nuclease-free water for use in an in vitro 
transcription reaction. The in vitro transcription reaction was performed using the 
Megascript T7 Kit (Ambion). A total of 1 ug of linearized DNA was aliquoted into a 
solution containing a final concentration of lx Reaction Buffer, Ix Enzyme Mix and 7.5 
mM of ATP, UTP, CTP and GTP. The reaction was incubated at 37°C for 4 hours in an air 
incubator. Following the incubation, 1U/1 Dg DNA of RQ1 DNase enzyme (Promega) was 
added to the reaction and incubated at 37°C for 2 hours. The newly synthesized RNA was 
purified using the RNeasy Kit (Qiagen). Following the manufacturer's protocol, the RNA 
was bound to the RNeasy column and an additional DNase digest was performed using 1U 
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of RNase-free DNase (Qiagen) for 1 hour at 25°C. The RNA was eluted in nuclease free 

water and stored at 

-80°C. 

Example 6. Electroporation of RNA into Huh-7 cells. 

Five Dg of replicon RNA was electroporated into 5x1 0 6 cells of the hepatoma cell 
line, Huh-7. Briefly, Huh-7 cells were passed the day before the electroporation and grown 
to 50% confluency. Huh-7 cells were pelleted, washed, counted and resuspended to a final 
concentration of IxlO 7 cells/ml in Opti-MEM media (Invitrogen). A total of 5 Dg of RNA 
was aliquoted into a pre-chilled cuvette on ice. A 0.2 ml aliquot of the cell suspension was 
then added to the cuvette containing the RNA. The RNA was immediately electroporated 
into the Huh-7 cells using the Gene Pulser (Bio-Rad), adjusted to deliver one pulse of 0.2 
kV, 100 ohms and 960 DFD. The cuvettes were chilled on ice for 10 minutes to allow the 
cells to recover. The cells were plated on a 100 mm tissue culture plate in complete media. 
Selection media containing either 0.5 to 1.0 mg/ml of G418 (Invitrogen) was added to the 
cells 24-48 hours post-electroporation. 

Example 7. Analysis of replicons by western blot analysis. 

Total cell lysates were harvested from replicon cells in l.lx LDS Buffer (Invitrogen). 
The lysates were heated for 10 minutes in the presence of a reducing agent. The lysates 
were run on a 10% Bis-Tris NuPage polyachrylamide gel (Invitrogen) in lx MOPS buffer as 
recommended by the manufacturer. Protein transfer to PVDF (Invitrogen) membrane was 
performed using the Semi-Dry Trans Blot System (Bio-Rad). Following transfer, the blot 
was rinsed once with PBS containing 0.5% Tween-20 (PBS-Tween). The blot was blocked 
in PBS-Tween containing 5% non-fat milk for Ihour at room temperature with gently 
agitation. The blot was rinsed quickly with two washes of PBS-Tween, followed by two ten 
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minute PBS-Tween washes with gentle agitation. The primary antibody incubation to detect 
HCV NS5a protein utilized an 0-NS5a rabbit polyclonal antibody dilued 1 :2500 in PBS- 
Tween. The incubation was conducted for 1 hour at room temperature with gentle agitation. 
The D-NS5a antibody was removed and the blot was rinsed quickly with two washes of 
PBS-Tween, followed by two 10 minute PBS-Tween washes with gentle agitation. A 
secondary antibody incubation was performed using a HRP conjugated □ -rabbit IgG 
antibody (Amersham) diluted in 1:5000 in PBS-Tween. Again, the incubation was done at 
room temperature for 1 hour with gentle agitation. Following the secondary antibody 
incubation, the blot was rinsed quickly with two washes of PBS-Tween, followed by two ten 
minute PBS-Tween washes with gentle agitation. The blot was incubated in Super Signal 
Chemiluminescent Reagent (Pierce) following the manufacturer's protocol and exposed to 
film (Amersham). 



Example 8. Isolation of genomic DNA. 

Genomic DNA was isolated using DNeasy Tissue Kit (Qiagen) according to the 
manufacturer's protocol. The experiments to determine possible genomic incorporation of 
the neomycin resistance gene was performed using PCR. Primers Ralf Neo 5'TCA AGA 
CCG ACC TGT CCG GTG CCC [SEQ ID NO:24] 

and Ralf Neo 3'CTT GAG CCT GGC GAA CAG TTC GGC [SEQ ID NO:25] 
to amplify a 380 bp product form 100 ng of genomic DNA. All PCR reactions utilized 2.5 
U of PFU Turbo polymerase (Stratagene) per 100 ul reaction. The PCR reactions contained 
a final primer and dNTP concentration of 250 nM and 100 DM, respectively. In addition, 
the final buffer concentrations comprised 20 mM Tris-HCl (pH 8.8), 2mM MgS0 4 , 10 mM 
KC1, 10 mM (NH4) 2 S0 4 , 0.1% Triton X-100, and 0.1 mg/ml nuclease free BSA. Control 
primers GAPDH for ACC ACA GTC CAT GCC ATC AC [SEQ ID NO:26] 
and GAPDH rev TCC ACC ACC CTG TTG CTG TA [SEQ ID NO:27] 
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were used to demonstrate the presence of intact genomic DNA in each PCR reaction. 



Example 9. Analysis of replicon copy number by TaqMan. 

Cells were trypinized and counted to determine cell number. The total cellular RNA was 
isolated using RNeasy Mini Kit (Qiagen) following the manufacturer's protocol. The RNA 
was stored at -80°C or assayed by TaqMan quantitative RT-PCR. For analysis by TaqMan 
PCR, a master mix was prepared using TaqMan EZ RT-PCR Kit (Applied Biosystems 
#403028), TaqMan PDAR Control Reagent Human Cyclophilin (Applied Biosystems 
#4310883E), custom primer Neo R fwd CCG GCT ACC TGC CCA TTC [SEQ ID NO:28] 
(Invitrogen), custom primer Neo R rev CCA GAT CAT CCG ATC GAC AAG [SEQ ID 
NO:29] (Invitrogen), and custom probe Neo R probe 5' FAM-ACA TCG CAT CGA GCG 
AGC ACG TAC-TAMRA 3' [SEQ ID NO:30] (Biosource International). Master mix 
concentrations were as follows: IX TaqMan EZ Buffer, 3 mM Mn(Oac) 2 , 0.3 mM dATP, 
0.3 mM dCTP, 0.3 mM dGTP, 0.6 mM dUTP, 0.2 mM Neo R fwd, 0.2 mM Neo R Rev, 0.1 
mM Neo R probe, IX Cyclophilin Mix, 0.1 Unit/Dl rTth DNA Polymerase, 0.01 Unit/Dl 
AmpErase UNG, and H 2 0 to 40 □!. A total of 40 Dl of master mix was added to wells of a 
96-tube optical plate (Applied Biosystems #N80 1-0560), followed by 10 Dl of RNA sample 
at a concentration of 5 ng/Dl. The plate was covered with an optical adhesive cover 
(Applied Biosystems #431 1971) and mixed by repeated inversion several times, followed by 
a brief centrifuge spin. The plate was placed in an ABI7700 (Applied Biosystems) and the 
entire plate was set to FAM dye layer for unknown, and to the VIC dye layer for control. 
The cycling parameters were set to 50°C, 2 min.; 60°C, 30 min.; 95°C, 5 min.; (94°C, 20 
sec; 55°C, 1 min.) 40 cycles, with an exposure time of 10 milliseconds and spectral 
compensation. Data was analyzed by leaving the baseline to default level on both layers, 
and set a threshold to an equal value for both dye layers. The data was exported to 
Microsoft Excel for further analysis. To determine replicon copy number per cell, a linear 

26 



WO 03/085084 




PCT7US03/10177 



regression curve for the known copy number Neo R standards was established by plotting the 
Cycle Threshold values (Ct) versus the log of copy number for each Neo R standard. The 
copy number for each replicon sample was calculated by taking the sample's Ct value, 
minus the line intercept, divided by the slope of the line. 



Table 3. Polynucleotide Sequences 

(A) Sequences from HCV H77 Replicons 

BB7-F1 [SEQ ID NO:2] 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAA 
CTACTGTCTTCACGCAGAAAGCGTCT 

AGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGA 
GAGCCATAGTGGTCTGCGGAACCGGT 

GAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCA 
ATGCCTGGAGATTTGGGCGTGCCCCC 

GCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCT 
GATAGGGTGCTTGCGAGTGCCCCGGG 

AGGTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAAC 
CAAAGGGCGCGCCATGATTGAACAAGA 

TGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGA 

CTGGGCACAACAGACAATCGGCTGCT 

CTGATGCCGCCGTGTTCCGGCTGTC AGCGC AGGGGCGCCCGGTTL: ill 1 1 GTCAA 

GACCGACCTGTCCGGTGCCCTGAAT 

GAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCC 
TTGCGCAGCTGTGCTCGACGTTGTCAC 

TGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCC 
TGTCATCTCACCTTGCTCCTGCCGAGA 

AAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTA 
CCTGCCCATTCGACCACCAAGCGAAA 

CATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGAT 
GATCTGGACGAAGAGCATCAGGGGCT 

CGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGG 
ATCTCGTCGTGACCCATGGCGATGCCT 

GCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTG 
GCCGGCTGGGTGTGGCGGACCGCTAT 

CAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGG 

TATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCITCIIGACGAGTTC 
TTCTGAGTTTAAACAGACCACAACG 
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GTTTCCCTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTA 
CTGGCCGAAGCCGCTTGGAATAAGG 

CCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATATTGCCGTCTTTTGGCAATG 
TGAGGGCCCGGAAACCTGGCCCTG 

TCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAG 
GTCTGTTGAATGTCGTGAAGGAAGCA 

GTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGG 
CAGCGGAACCCCCCACCTGGCGACAG 

GTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCAC 
AACCCCAGTGCCACGTTGTGAGTTGGA 

TAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGA 
AGGATGCCCAGAAGGTACCCCATTGT 

ATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGG 
TTAAAAAACGTCTAGGCCCCCCGAAC 

CACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATACCATGGCGCCTATTAC 
GGCCTACTCCCAACAGACGCGAGGCC 

TACTTGGCTGCATCATCACTAGCCTCACAGGCCGGGACAGGAACCAGGTCGAGG 
GGGAGGTCCAAGTGGTCTCCACCGCA 

ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 
GTGCCGGCTCAAAGACCCTTGCCGG 

CCCAAAGGGCCCAATCACCCAAATGTAcACCAATGTGGACCAAGACCTTGTGGG 
CTGGCCCGCTCCTCAAGGTTCCCGCT 

CATTGACACCCTGTACCTGCGGCTCCTCGGACCTTTACCTGGTCACGAGGCACG 
CCGATGTCATTCCCGTGCGCCGGCGA 

GGTGATAGCAGGGGTAGCCTGCrrTCGCCCCGGCCCATTTCCTACTTGAAAGGC 
TCCTCGGGGGGTCCGCTGTTGTGCCC 

CGCGGGACACGCCGTGGGCCTATTCAGGGCCGCGGTGTGCACCCGTGGAGTGG 
CTAAAGCGGTGGACTTTATCCCTGTGG 

AGAACCTAGGGACAACCATGAGATCCCCGGTGTTCACGGACAACTCCTCTCCAC 
CAGCAGTGCCCCAGAGCTTCCAGGTG 

GCCCACCTGCATGCTCCCACCGGCAGCGGTAAGAGCACCAAGGTCCCGGCTGC 
GTACGCAGCCCAGGGCTACAAGGTGTT 

GGTGCTCAACCCCTCTGTTGCTGCAACGCTGGGCTTTGGTGCTTACATGTCCAAG 
GCCCATGGGGTTGATCCTAATATCA 

GGACCGGGGTGAGAACAATTACCACTGGCAGCCCCATCACGTACTCCACCTACG 
GCAAGTTCCTTGCCGACGGCGGGTGC 

TCAGGAGGTGCTTATGACATAATAATTTGTGACGAGTGCCACTCCACGGATGCC 
ACATCCATCTTGGGCATCGGCACTGT 

CCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACTGCTAC 
CCCTCCGGGCTCCGTCACTGTGTCCC 

ATCCTAACATCGAGGAGGTTGCTCTGTCCACCACCGGAGAGATCCCCTTTTACG 
GCAAGGCTATCCCCCTCGAGGTGATC 
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AAGGGGGGAAGACATCTCATCTTCTGCCACTCAAAGAAGAAGTGCGACGAGCT 
CGCCGCGAAGCTGGTCGCATTGGGCAT 

CAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCTGTCATCCCGACCAGCGG 
CGATGTTGTCGTCGTGTCGACCGATG 

CTCTCATGACTGGCTTTACCGGCGACTTCGACTCTGTGATAGACTGCAACACGT 
GTGTCACTCAGACAGTCGATTTCAGC 

CTTGACCCTACCTTTACCATTGAGACAACCACGCTCCCCCAGGATGCTGTCTCCA 
GGACTCAACGCCGGGGCAGGACTGG 

CAGGGGGAAGCCAGGCATCTATAGATTTGTGGCACCGGGGGAGCGCCCCTCCG 
GCATGTTCGACTCGTCCGTCCTCTGTG 

AGTGCTATGACGCGGGCTGTGCTTGGTATGAGCTCACGCCCGCCGAGACTACAG 
TTAGGCTACGAGCGTACATGAACACC 

CCGGGGCTTCCCGTGTGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACG 
GGCCTCACTCATATAGATGCCCACTT 

TTTATCCCAGACAAAGCAGAGTGGGGAGAACTTTCCTrACCTGGTAGCGTACCA 
AGCCACCGTGTGCGCTAGGGCTCAAG 

CCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATCCGCCTTAAACCCA 
CCCTCCATGGGCCAACACCCCTGCTA 

TACAGACTGGGCGCTGTTCAGAATGAAGTCACCCTGACGCACCCAATCACCAAA 
TACATCATGACATGCATGTCGGCCGA 

CCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTCT 
GGCCGCGTATTGCCTGTCAACAGGCT 

GCGTGGTCATAGTGGGCAGGATCGTCTTGTCCGGGAAGCCGGCAATTATACCTG 
ACAGGGAGGTTCTCTACCAGGAGTTC 

GATGAGATGGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGAT 
GCTCGCTGAGCAGTTCAAGCAGAAGGC 

CCTCGGCCTCCTGCAGACCGCGTCCCGCCATGCAGAGGTTATCACCCCTGCTGT 
CCAGACCAACTGGCAGAAACTCGAGG 

TCTTTTGGGCGAAGCACATGTGGAATTTCATCAGTGGGATACAATACTTGGCGG 
GCCTGTCAACGCTGCCTGGTAACCCC 

GCCATTGCITCATTGATGGCTTTTACAGCTGCCGTCACCAGCCCACTAACCACTG 
GCCAAACCCTCCTCTTCAACATATT 

GGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGT 
GGGTGCTGGCCTAGCTGGCGCCGCCA 

TCGGCAGCGTTGGACTGGGGAAGGTCCTCGTGGACATTCTTGCAGGGTATGGCG 
CGGGCGTGGCGGGAGCTCTTGTAGCA 

TTCAAGATCATGAGCGGTGAGGTCCCCTCCACGGAGGACCTGGTCAATCTGCTG 
CCCGCCATCCTCTCGCCTGGAGCCCT 

TGTAGTCGGTGTGGTCTGCGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGA 
GGGGGCAGTGCAATGGATGAACCGGC 

TAATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGCACTACGTGCCGG 
AGAGCGATGCAGCCGCCCGCGTCACT 
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GCCATACTCAGCAGCCTCACTGTAACCCAGCTCCTGAGGCGACTGCATCAGTGG 
ATAAGCTCGGAGTGTACCACTCCATG 

CTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGCTGAGCGA 
CTTTAAGACCTGGCTGAAAGCCAAGC 

TCATGCCACAACTGCCTGGGATTCCCTTTGTGTCCTGCCAGCGCGGGTATAGGG 
GGGTCTGGCGAGGAGACGGCATTATG 

CACACTCGCTGCCACTGTGGAGCTGAGATCACTGGACATGTCAAAAACGGGAC 
GATGAGGATCGTCGGTCCTAGGACCTG 

CAGGAACATGTGGAGTGGGACGTTCCCCATTAACGCCTACACCACGGGCCCCTG 
TACTCCCCTTCCTGCGCCGAACTATA 

AGTTCGCGCTGTGGAGGGTGTCTGCAGAGGAATACGTGGAGATAAGGCGGGTG 
GGGGACTTCCACTACGTATCGGGTATG 

ACTACTGACAATCTTAAATGCCCGTGCCAGATCCCATCGCCCGAATTCTTCACA 
GAAGTGGATGGGGTGCGGTTGCACAG 

GTACGCTCCAGCGTGCAAACCCCTCCTACGGGAGGAGGTCACATTCCTGGTCGG 
GCTCAATCAATACCTGGTTGGGTCAC 

AGCTCCCATGCGAGCCCGAACCGGACGTAGCAGTGCTCACTTCCATGCTCACCG 
ACCCCTCCCACATTACGGCGGAGACG 

GCTAAGCGTAGGCTGGCCAGGGGATCTCCCCCCTCCTTGGCCAGCTCATCAGCT 
ATCCAGCTGTCTGCGCCTTCCTTGAA 

GGCAACATGCACTACCCGTCATGACTCCCCGGACGCTGACCTCATCGAGGCCAA 
CCTCCTGTGGCGGCAGGAGATGGGCG 

GGAACATCACCCGCGTGGAGTCAGAAAATAAGGTAGTAATTTTGGACTCTTTCG 
AGCCGCTCCAAGCGGAGGAGGATGAG 

AGGGAAGTATCCGTTCCGGCGGAGATCCTGCGGAGGTCCAGGAAATTCCCTCGA 
GCGATGCCCATATGGGCACGCCCGGA 

TTACAACCCTCCACTGTTAGAGTCCTGGAAGGACCCGGACTACGTCCCTCCAGT 
GGTACACGGGTGTCCATTGCCGCCTG 

CCAAGGCCCCTCCGATACCACCTCCACGGAGGAAGAGGACGGTTGTCCTGTCAG 
AATCTACCGTGTCTTCTGCCTTGGCG 

GAGCTCGCCACAAAGACCTTCGGCAGCTCCGAATCGTCGGCCGTCGACAGCGG 
CACGGCAACGGCCTCTCCTGACCAGCC 

CTCCGACGACGGCGACGCGGGATCCGACGTTGAGTCGTACTCCTCCATGCCCCC 
CCTTGAGGGGGAGCCGGGGGATCCCG 

ATCTCAGCGACGGGTCTTGGTCTACCGTAAGCGAGGAGGCTAGTGAGGACGTCG 
TCTGCTGCTCGATGTCCTACACATGG 

ACAGGCGCCCTGATCACGCCATGCGCTGCGGAGGAAACCAAGCTGCCCATCAA 
TGCACTGAGCAACTCTTTGCTCCGTCA 

CCACAACTTGGTCTATGCTACAACATCTCGCAGCGCAAGCCTGCGGCAGAAGAA 
GGTCACCTTTGACAGACTGCAGGTCC 

TGGACGACCACTACCGGGACGTGCTCAAGGAGATGAAGGCGAAGGCGTCCACA 
GTTAAGGCTAAACTTCTATCCGTGGAG 
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GAAGCCTGTAAGCTGACGCCCCCACATTCGGCCAGATCTAAATTTGGCTATGGG 
GCAAAGGACGTCCGGAACCTATCCAG 

CAAGGCCGTTAACCACATCCGCTCCGTGTGGAAGGACTTGCTGGAAGACACTGA 
GACACCAATTGACACCACCATCATGG 

CAAAAAATGAGGTTTTCTGCGTCCAACCAGAGAAGGGGGGCCGCAAGCCAGCT 
CGCCTTATCGTATTCCCAGATTTGGGG 

GTTCGTGTGTGCGAGAAAATGGCCCTTTACGATGTGGTCTCCACCCTCCCTCAG 
GCCGTGATGGGCTCTTCATACGGATT 

CCAATACTCTCCTGGACAGCGGGTCGAGTTCCTGGTGAATGCCTGGAAAGCGAA 
GAAATGCCCTATGGGCTTCGCATATG 

ACACCCGCTGTTTTGACTCAACGGTCACTGAGAATGACATCCGTGTTGAGGAGT 
CAATCTACCAATGTTGTGACTTGGCC 

CCCGAAGCCAGACAGGCCATAAGGTCGCTCACAGAGCGGCTTTACATCGGGGG 
CCCCCTGACTAATTCTAAAGGGCAGAA 

CTGCGGCTATCGCCGGTGCCGCGCGAGCGGTGTACTGACGACCAGCTGCGGTAA 
TACCCTCACATGTTACTTGAAGGCCG 

CTGCGGCCTGTCGAGCTGCGAAGCTCCAGGACTGCACGATGCTCGTATGCGGAG 
ACGACCTTGTCGTTATCTGTGAAAGC 

GCGGGGACCCAAGAGGACGAGGCGAGCCTACGGGCCTTCACGGAGGCTATGAC 
TAGATACTCTGCCCCCCCTGGGGACCC 

GCCCAAACCAGAATACGACTTGGAGTTGATAACATCATGCTCCTCCAATGTGTC 
AGTCGCGCACGATGCATCTGGCAAAA 

GGGTGTACTATCTCACCCGTGACCCCACCACCCCCCTTGCGCGGGCTGCGTGGG 
AGACAGCTAGACACACTCCAGTCAAT 

TCCTGGCTAGGCAACATCATCATGTATGCGCCCACCTTGTGGGCAAGGATGATC 
CTGATGACTCATTTCTTCTCCATCCT 

TCTAGCTCAGGAACAACTTGAAAAAGCCCTAGATTGTCAGATCTACGGGGCCTG 
TTACTCCATTGAGCCACTTGACCTAC 

CTCAGATCATTCAACGACrCCATGGCCTTAGCGCATTITCACTCCATAGTTACTC 
TCCAGGTGAGATCAATAGGGTGGCT 

TCATGCCTCAGGAAACTTGGGGTACCGCCCTTGCGAGTCTGGAGACATCGGGCC 
AGAAGTGTCCGCGCTAGGCTACTGTC 

CCAGGGGGGGAGGGCTGCCACTTGTGGCAAGTACCTCTTCAACTGGGCAGTAA 
GGACCAAGCTCAAACTCACTCCAATCC 

CGGCTGCGTCCCAGTTGGATTTATCCAGCTGGTTCGTTGCTGGTTACAGCGGGG 
GAGACATATATCACAGCCTGTCTCGT 

GCCCGACCCCGCTGGTTCATGTGGTGCCTACTCCTACTTTCTGTAGGGGTAGGCA 

TCTATCTACTCCCCAACCGATGAAC 

GGGG ACCT AAAC ACTCCAGGCCA AT AGGCC ATCCTGTTTTTTTCCC1" 1" 1 TTTTTI 

TTCimTi'iini i lTiTrrii ii 

TTTTTTTTTI 1 rCTCXJITTTTTTTTCCTCTTTTTTTCCT^ 
CCATCTTAGCCCTAGTCACGGC 
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TAGCTGTGAAAGGTCCGTGAGCCGCTTGACTGCAGAGAGTGCTGATACTGGCCT 
CTCTGCAGATCAAGT 

BB7-F1/F2: [SEQ ID NO:3] 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAA 
CTACTGTCTTCACGCAGAAAGCGTCT 

AGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGA 
GAGCCATAGTGGTCTGCGGAACCGGT 

GAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCA 
ATGCCTGGAGATTTGGGCGTGCCCCC 

GCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCT 
GATAGGGTGCTTGCGAGTGCCCCGGG 

AGGTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAAC 
CAAAGGGCGCGCCATGATTGAACAAGA 

TGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGA 
CTGGGCACAACAGACAATCGGCTGCT 

CTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAA 
GACCGACCTGTCCGGTGCCCTGAAT 

GAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCC 
TTGCGCAGCTGTGCTCGACGTTGTCAC 

TGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCC 
TGTCATCTCACCTTGCTCCTGCCGAGA 

AAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTA 
CCTGCCCATTCGACCACCAAGCGAAA 

CATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGAT 
GATCTGGACGAAGAGCATCAGGGGCT 

CGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGG 
ATCTCGTCGTGACCCATGGCGATGCCT 

GCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTG 
GCCGGCTGGGTGTGGCGGACCGCTAT 

CAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGG 

TATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTC 
TTCTGAGTTTAAACAGACCACAACG 

GTTTCCCTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTA 
CTGGCCGAAGCCGCTTGGAATAAGG 

CCGGTGTGCGTITGTCTATATGTTATTTTCCACCATATTGCCGTCTTTTGGCAATG 
TGAGGGCCCGGAAACCTGGCCCTG 

TCITCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAG 
GTCTGTTGAATGTCGTGAAGGAAGCA 

GTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGG 
CAGCGGAACCCCCCACCTGGCGACAG 
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GTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCAC 
AACCCCAGTGCCACGTTGTGAGTTGGA 

TAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGA 
AGGATGCCCAGAAGGTACCCCATTGT 

ATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGG 
TTAAAAAACGTCTAGGCCCCCCGAAC 

CACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATACCATGGCGCCTATTAC 
GGCCTACTCCCAACAGACGCGAGGCC 

TACTTGGCTGCATCATCACTAGCCTCACAGGCCGGGACAGGAACCAGGTCGAGG 
GGGAGGTCCAAGTGGTCTCCACCGCA 

ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 
GTGCCGGCTCAAAGACCCTTGCCGG 

CCCAAAGGGCCCAATCACCCAAATGTAcACCAATGTGGACCAAGACCTTGTGGG 
CTGGCCCGCTCCTCAAGGTTCCCGCT 

CATTGACACCCTGTACCTGCGGCTCCTCGGACCTTTACCTGGTCACGAGGCACG 
CCGATGTCATTCCCGTGCGCCGGCGA 

GGTGATAGCAGGGGTAGCCTGCrTTCGCCCCGGCCCATTTCCTACTTGAAAGGC 
TCCTCGGGGGGTCCGCTGTTGTGCCC 

CGCGGGACACGCCGTGGGCCTATTCAGGGCCGCGGTGTGCACCCGTGGAGTGG 
CTAAAGCGGTGGACTTTATCCCTGTGG 

AGAACCTAGGGACAACCATGAGATCCCCGGTGTTCACGGACAACTCCTCTCCAC 
CAGCAGTGCCCCAGAGCTTCCAGGTG 

GCCCACCTGCATGCTCCCACCGGCAGCGGTAAGAGCACCAAGGTCCCGGCTGC 
GTACGCAGCCCAGGGCTACAAGGTGTT 

GGTGCTCAACCCCTCTGTTGCTGCAACGCTGGGCTTTGGTGCTTACATGTCCAAG 
GCCCATGGGGTTGATCCTAATATCA 

GGACCGGGGTGAGAACAATTACCACTGGCAGCCCCATCACGTACTCCACCTACG 
GCAAGTTCCTTGCCGACGGCGGGTGC 

TCAGGAGGTGCTTATGACATAATAATTTGTGACGAGTGCCACTCCACGGATGCC 
ACATCCATCTTGGGCATCGGCACTGT 

CCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACTGCTAC 
CCCTCCGGGCTCCGTCACTGTGTCCC 

ATCCTAACATCGAGGAGGTTGCTCTGTCCACCACCGGAGAGATCCCCTTTTACG 
GCAAGGCTATCCCCCTCGAGGTGATC 

AAGGGGGGAAGACATCTCATCTTCTGCCACTCAAAGAAGAAGTGCGACGAGCT 
CGCCGCGAAGCTGGTCGCATTGGGCAT 

CAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCTGTCATCCCGACCAGCGG 
CGATGTTGTCGTCGTGTCGACCGATG 

CTCTCATGACTGGCTTTACCGGCGACTTCGACTCTGTGATAGACTGCAACACGT 
GTGTCACTCAGACAGTCGATTTCAGC 

CTTGACCCTACCnTACCATTGAGACAACCACGCTCCCCCAGGATGCTGTCTCCA 
GGACTCAACGCCGGGGCAGGACTGG 
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CAGGGGGAAGCCAGGCATCTATAGATTTGTGGCACCGGGGGAGCGCCCCTCCG 
GCATGTTCGACTCGTCCGTCCTCTGTG 

AGTGCTATGACGCGGGCTGTGCTTGGTATGAGCTCACGCCCGCCGAGACTACAG 
TTAGGCTACGAGCGTACATGAACACC 

CCGGGGCTTCCCGTGTGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACG 
GGCCTCACTCATATAGATGCCCACTT 

TTTATCCCAGACAAAGCAGAGTGGGGAGAACTTTCCTTACCTGGTAGCGTACCA 
AGCCACCGTGTGCGCTAGGGCTCAAG 

CCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATCCGCCTTAAACCCA 
CCCTCCATGGGCCAACACCCCTGCTA 

TACAGACTGGGCGCTGTTCAGAATGAAGTCACCCTGACGCACCCAATCACCAAA 
TACATCATGACATGCATGTCGGCCGA 

CCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTCT 
GGCCGCGTATTGCCTGTCAACAGGCT 

GCGTGGTCATAGTGGGCAGGATCGTCTTGTCCGGGAAGCCGGCAATTATACCTG 
ACAGGGAGGTTCTCTACCAGGAGTTC 

GATGAGATGGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGAT 
GCTCGCTGAGCAGTTCAAGCAGAAGGC 

CCTCGGCCTCCTGCAGACCGCGTCCCGCCATGCAGAGGTTATCACCCCTGCTGT 
CCAGACCAACTGGCAGAAACTCGAGG 

TCTTTTGGGCGAAGCACATGTGGAATTTCATCAGTGGGATACAATACTTGGCGG 
GCCTGTCAACGCTGCCTGGTAACCCC 

GCCATTGCTTCATTGATGGCTTTTACAGCTGCCGTCACCAGCCCACTAACCACTG 
GCCAAACCCTCCTCTTCAACATATT 

GGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGT 
GGGTGCTGGCCTAGCTGGCGCCGCCA 

TCGGCAGCGTTGGACTGGGGAAGGTCCTCGTGGACATTCTTGCAGGGTATGGCG 
CGGGCGTGGCGGGAGCTCTTGTAGCA 

TrCAAGATCATGAGCGGTGAGGTCCCCTCCACGGAGGACCTGGTCAATCTGCTG 
CCCGCCATCCTCTCGCCTGGAGCCCT 

TGTAGTCGGTGTGGTCTGCGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGA 
GGGGGCAGTGCAATGGATGAACCGGC 

TAATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGCACTACGTGCCGG 
AGAGCGATGCAGCCGCCCGCGTCACT 

GCCATACTCAGCAGCCTCACTGTAACCCAGCTCCTGAGGCGACTGCATCAGTGG 
ATAAGCTCGGAGTGTACCACTCCATG 

CTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGCTGAGCGA 
CTTTAAGACCTGGCTGAAAGCCAAGC 

TCATGCCACAACTGCCTGGGATTCCCTTTGTGTCCTGCCAGCGCGGGTATAGGG 
GGGTCTGGCGAGGAGACGGCATTATG 

CACACTCGCTGCCACTGTGGAGCTGAGATCACTGGACATGTCAAAAACGGGAC 
GATGAGGATCGTCGGTCCTAGGACCTG 
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CAGGAACATGTGGAGTGGGACGTTCCCCATTAACGCCTACACCACGGGCCCCTG 
TACTCCCCTTCCTGCGCCGAACTATA 

AGTTCGCGCTGTGGAGGGTGTCTGCAGAGGAATACGTGGAGATAAGGCGGGTG 
GGGGACTTCCACTACGTATCGGGTATG 

ACTACTGACAATCTTAAATGCCCGTGCCAGATCCCATCGCCCGAATTcTTCACAG 
AATTGGACGGGGTGCGCCTACACAG 

GTTTGCGCCCCCTTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAGG 
ACTCCACGAGTACCCGGTGGGGTCGC 

AATTACCTTGCGAGCCCGAACCGGACGTAGCCGTGTTGACGTCCATGCTCACTG 
ATCCCTCCCATATAACAGCAGAGGCG 

GCCGGGAGAAGGTTGGCGAGAGGGTCACCCCCTTCTATGGCCAGCTCCTCGGCT 
AtCCAGCTGTCCGCTCCATCTCTCAA 

GGCAACTTGCACCGCCAACCATGACTCCCCTGACGCCGAGCTCATAGAGGCTAA 
CCTCCTGTGGAGGCAGGAGATGGGCG 

GCAACATCACCAGGGTTGAGTCAGAGAACAAAGTGGTGATTCTGGACTCCTTCG 
ATCCGCTTGTGGCAGAGGAGGATGAG 

CGGGAGGTCTCCGTACCTGCAGAAATTCTGCGGAAGTCTCGGAGATTCGCCCGG 
GCCCTGCCCGTCTGGGCGCGGCCGGA 

CTACAACCCCCCGCTAGTAGAGACGTGGAAAAAGCCTGACTACGAACCACCTGT 
GGTCCATGGCTGCCCGCTACCACCTC 

CACGGTCCCCTCCTGTGCCTCCGCCTCGGAAAAAGCGTACGGTGGTCCTCACCG 
AATCAACCCTATCTACTGCCTTGGCC 

GAGCTTGCCACCAAAAGTTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGAC 
AATACGACAACATCCTCTGAGCCCGC 

CCCTTCTGGCTGCCCCCCCGACTCCGACGTTGAGTCCTATTCTTCCATGCCCCCC 
CTGGAGGGGGAGCCTGGGGATCCGG 

ATCTCAGCGACGGGTCATGGTCGACGGTCAGTAGTGGGGCCGACACGGAAGAT 
GTCGTGTGCTGCTCAATGTCTTATTCC 

TGGACAGGCGCACTCGTCACCCCGTGCGCTGCGGAAGAACAAAAACTGCCCAT 
CAACGCACTGAGCAACTCGTTGCTACG 

CCATCACAATCTGGTGTATTCCACCACTTCACGCAGTGCTTGCCAAAGGCAGAA 
GAAAGTCACATTTGACAGACTGCAAG 

TTCTGGACAGCCATTACCAGGACGTGCTCAAGGAGGTCAAAGCAGCGGCGTCA 
AAAGTGAAGGCTAACTTGCTATCCGTA 

GAGGAAGCTTGCAGCCTGACGCCCCCACATTCAGCCAAATCCAAGTTTGGCTAT 
GGGGCAAAAGACGTCCGTTGCCATGC 

CAGAAAGGCCGTAGCCCACATCAACTCCGTGTGGAAAGACCTTCTGGAAGACA 
GTGTAACACCAATTGACACCACCATCA 

TGGCAAAAAATGAGGTTTTCTGCGTCCAACCAGAGAAGGGGGGCCGCAAGCCA 
GCTCGCCTTATCGTATTCCCAGATTTG 

GGGGTTCGTGTGTGCGAGAAAATGGCCCTTTACGATGTGGTCTCCACCCTCCCT 
CAGGCCGTGATGGGCTCTTCATACGG 
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ATTCCAATACTCTCCTGGACAGCGGGTCGAGTTCCTGGTGAATGCCTGGAAAGC 
GAAGAAATGCCCTATGGGCTTCGCAT 

ATGACACCCGCTGTTTTGACTCAACGGTCACTGAGAATGACATCCGTGTTGAGG 
AGTCAATCTACCAATGTTGTGACTTG 

GCCCCCGAAGCCAGACAGGCCATAAGGTCGCTCACAGAGCGGCTTTACATCGG 
GGGCCCCCTGACTAATTCTAAAGGGCA 

GAACTGCGGCTATCGCCGGTGCCGCGCGAGCGGTGTACTGACGACCAGCTGCG 
GTAATACCCTCACATGTTACTTGAAGG 

CCGCTGCGGCCTGTCGAGCTGCGAAGCTCCAGGACTGCACGATGCTCGTATGCG 
GAGACGACCTTGTCGTTATCTGTGAA 

AGCGCGGGGACCCAAGAGGACGAGGCGAGCCTACGGGCCTTCACGGAGGCTAT 
GACTAGATACTCTGCCCCCCCTGGGGA 

CCCGCCCAAACCAGAATACGACTTGGAGTTGATAACATCATGCTCCTCCAATGT 
GTCAGTCGCGCACGATGCATCTGGCA 

AAAGGGTGTACTATCTCACCCGTGACCCCACCACCCCCCTTGCGCGGGCTGCGT 
GGGAGACAGCTAGACACACTCCAGTC 

AATTCCTGGCTAGGCAACATCATCATGTATGCGCCCACCTTGTGGGCAAGGATG 
ATCCTGATGACTCATTTCTTCTCCAT 

CCTTCTAGCTCAGGAACAACTTGAAAAAGCCCTAGATTGTCAGATCTACGGGGC 
CTGTTACTCCATTGAGCCACTTGACC 

TACCTCAGATCATTCAACGACTCCATGGCCTTAGCGCATTTTCACTCCATAGTTA 
CTCTCCAGGTGAGATCAATAGGGTG 

GCTTCATGCCTCAGGAAACTTGGGGTACCGCCCTTGCGAGTCTGGAGACATCGG 
GCCAGAAGTGTCCGCGCTAGGCTACT 

GTCCCAGGGGGGGAGGGCTGCCACTTGTGGCAAGTACCTCTTCAACTGGGCAGT 
AAGGACCAAGCTCAAACTCACTCCAA 

TCCCGGCTGCGTCCCAGTTGGATTTATCCAGCTGGTTCGTTGCTGGTTACAGCGG 
GGGAGACATATATCACAGCCTGTCT 

CGTGCCCGACCCCGCTGGTTCATGTGGTGCCTACTCCTACTTTCTGTAGGGGTAG 
GCATCTATCTACTCCCCAACCGATG 

AACGGGGACCTAAACACTCCAGGCCAATAGGCCATCCTGTTITTTTCCCTTTTTT 

TTTTTCITTTTTTTTTTTTTTT^ 

TTTTTTTTTTTTTTTCTCCIT^ 

CTCCATCTTAGCCCTAGTCAC 

GGCTAGCTGTGAAAGGTCCGTGAGCCGCTTGACTGCAGAGAGTGCTGATACTGG 
CCTCTCTGCAGATCAAGT 

BB7-F3: [SEQ ID NO:4] 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAA 
CTACTGTCTTCACGCAGAAAGCGTCT 

AGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGA 
GAGCCATAGTGGTCTGCGGAACCGGT 
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GAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCA 
ATGCCTGGAGATTTGGGCGTGCCCCC 

GCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCT 
GATAGGGTGCTTGCGAGTGCCCCGGG 

AGGTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAAC 
CAAAGGGCGCGCCATGATTGAACAAGA 

TGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGA 
CTGGGCACAACAGACAATCGGCTGCT 

CTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCH 1 i 1GTCAA 
GACCGACCTGTCCGGTGCCCTGAAT 

GAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCC 
TTGCGCAGCTGTGCTCGACGTTGTCAC 

TGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCC 
TGTCATCTCACCTTGCTCCTGCCGAGA 

AAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTA 
CCTGCCCATTCGACCACCAAGCGAAA 

CATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGAT 
GATCTGGACGAAGAGCATCAGGGGCT 

CGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGG 
ATCTCGTCGTGACCCATGGCGATGCCT 

GCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTG 
GCCGGCTGGGTGTGGCGGACCGCTAT 

CAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGG 

TATCGCCGCTCCCGATTCGCAGCGGATCGCCTTCTATCGCCTTCTTGACGAGTTC 
TTCTGAGTTTAAACAGACCACAACG 

GnTCCCTCTAGCGGGATCAATrCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTA 
CTGGCCGAAGCCGCTTGGAATAAGG 

CCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATATTGCCGTCTTTTGGCAATG 
TGAGGGCCCGGAAACCTGGCCCTG 

TCTTCTrGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAG 
GTCTGTTGAATGTCGTGAAGGAAGCA 

GTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGG 
CAGCGGAACCCCCCACCTGGCGACAG 

GTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCAC 
AACCCCAGTGCCACGTTGTGAGTTGGA 

TAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGA 
AGGATGCCCAGAAGGTACCCCATTGT 

ATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGG 
TTAAAAAACGTCTAGGCCCCCCGAAC 

CACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATACCATGGCGCCTATTAC 
GGCCTACTCCCAACAGACGCGAGGCC 



37 



WO 03/085084 




PCT/US03/10177 



TACTTGGCTGCATCATCACTAGCCTCACAGGCCGGGACAGGAACCAGGTCGAGG 
GGGAGGTCCAAGTGGTCTCCACCGCA 

ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 
GTGCCGGCTCAAAGACCCTTGCCGG 

CCCAAAGGGCCCAATCACCCAAATGTACACCAATGTGGACCAGGACCTCGTCG 
GCTGGCAAGCGCCCCCCGGGGCGCGTT 

CCTTGACACCATGCACCTGCGGCAGCTCGGACCTTTACTTGGTCACGAGGCATG 
CCGATGTCATTCCGGTGCGCCGGCGG 

GGCGACAGCAGGGGGAGCCTACTCTCCCCCAGGCCCGTCTCCTACTTGAAGGGC 
TCTTCGGGCGGTCCACTGCTCTGCCC 

CTCGGGGCACGCTGTGGGCATCTTTCGGGCTGCCGTGTGCACCCGAGGGGTTGC 
GAAGGCGGTGGACTTTGTACCCGTCG 

AGTCTATGGAAACCACTATGCGGTCCCCGGTCTTCACGGACAACTCGTCCCCTC 
CGGCCGTACCGCAGACATTCCAGGTG 

GCCCATCTACACGCCCCTACTGGTAGCGGCAAGAGCACTAAGGTGCCGGCTGCG 
TATGCAGCCCAAGGGTATAAGGTGCT 

TGTCCTGAACCCGTCCGTCGCCGCCACCCTAGGTTTCGGGGCGTATATGTCTAA 
GGCACATGGTATCGACCCTAACATCA 

GAACCGGGGTAAGGACCATCACCACGGGTGCCCCCATCACGTACTCCACCTATG 
GCAAGTTTCTTGCCGACGGTGGTTGC 

TCTGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAACTGACTCG 
ACCACTATCCTGGGCATCGGCACAGT 

CCTGGACCAAGCGGAGACGGCTGGAGCGCGACTCGTCGTGCTCGCCACCGCTA 
CGCCTCCGGGATCGGTCACCGTGCCAC 

ATCCAAACATCGAGGAGGTGGCTCTGTCCAGCACTGGAGAAATCCCCTTTTATG 
GCAAAGCCATCCCCATCGAGACCATC 

AAGGGGGGGAGGCACCTCATTTTCTGCCATTCCAAGAAGAAATGTGATGAGCTC 
GCCGCGAAGCTGTCCGGCCTCGGACT 

CAATGCTGTAGCATATTACCGGGGCCTTGATGTATCCGTCATACCAACTAGCGG 
AGACGTCATTGTCGTAGCAACGGACG 

CTCTAATGACGGGCTTTACCGGCGATTTCGACTCAGTGATCGACTGCAATACAT 
GTGTCACCCAGACAGTCGACTTCAGC 

CTGGACCCGACCTTCACCATTGAGACGACGACCGTGCCACAAGACGCGGTGTCA 
CGCTCGCAGCGGCGAGGCAGGACTGG 

TAGGGGCAGGATGGGCATTTACAGGTTTGTGACTCCAGGAGAACGGCCCTCGG 
GCATGTTCGATTCCTCGGTTCTGTGCG 

AGTGCTATGACGCGGGCTGTGCTTGGTACGAGCTCACGCCCGCCGAGACCTCAG 
TTAGGTTGCGGGCTTACCTAAACACA 

CCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGGGAGAGCGTCTTTACA 
GGCCTCACCCACATAGACGCCCATTT 

CTTGTCCCAGACTAAGCAGGCAGGAGACAACTTCCCCTACCTGGTAGCATACCA 
GGCTACGGTGTGCGCCAGGGCTCAGG 
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CTCCACCTCCATCGTGGGACCAAATGTGGAAGTGTCTCATACGGCTAAAGCCTA 
CGCTGCACGGGCCAACGCCCCTGCTG 

TATAGGCTGGGAGCCGTTCAAAACGAGGTTACTACCACACACCCCATAACCAAA 
TACATCATGGCATGCATGTCGGCTGA 

CCTGGAGGTCGTCACGAGCACCTGGGTGCTGGTAGGCGGAGTCCTAGCAGCTCT 
GGCCGCGTATTGCCTGACAACAGGCA 

GCGTGGTCATTGTGGGCAGGATCATCTTGTCCGGAAAGCCGGCCATCATTCCCG 
ACAGGGAAGTCCTTTACCGGGAGTTC 

GATGAGATGGAAGAGTGCGCCTCACACCTCCCTTACATCGAACAGGGAATGCA 
GCTCGCCGAACAATTCAAACAGAAGGC 

AATCGGGTTGCTGCAAACAGCCACCAAGCAAGCGGAGGCTGCTGCTCCCGTGGT 
GGAATCCAAGTGGCGGACCCTCGAAG 

CCTTCTGGGCGAAGCATATGTGGAATTTCATCAGCGGGATACAATATTTAGCAG 
GCTTGTCCACTCTGCCTGGCAACCCC 

GCGATAGCATCACTGATGGCATTCACAGCCTCTATCACCAGCCCGCTCACCACC 
CAACATACCCTCCTGTTTAACATCCT 

GGGGGGATGGGTGGCCGCCCAACTTGCTCCTCCCAGCGCTGCITCTGCTTTCGT 
AGGCGCCGGCATCGCTGGAGCGGCTG 

TTGGCAGCATAGGCCTTGGGAAGGTGCTTGTGGATATTTTGGCAGGTTATGGAG 
CAGGGGTGGCAGGCGCGCTCGTGGCC 

TTTAAGGTCATGAGCGGCGAGATGCCCTCCACCGAGGACCTGGTTAACCTACTC 
CCTGCTATCCTCTCCCCTGGCGCCCT 

AGTCGTCGGGGTCGTGTGCGCAGCGATACTGCGTCGGCACGTGGGCCCAGGGG 
AGGGGGCTGTGCAGTGGATGAACCGGC 

TGATAGCGTTCGCTTCGCGGGGTAACCACGTCTCCCCCACGCACTATGTGCCTG 
AGAGCGACGCTGCAGCACGTGTCACT 

CAGATCCTCTCTAGTCTTACCATCACTCAGCTGCTGAAGAGGCTTCACCAGTGG 
ATCAACGAGGACTGCTCCACGCCATG 

CTCCGGCTCGTGGCTAAGAGATGTTTGGGATTGGATATGCACGGTGTTGACTGA 
TTTCAAGACCTGGCTCCAGTCCAAGC 

TCCTGCCGCGATTGCCGGGAGTCCCCTTCTTCTCATGTCAACGTGGGTACAAGG 
GAGTCTGGCGGGGCGACGGCATCATG 

CAAACCACCTGCCCATGTGGAGCACAGATCACCGGACATGTGAAAAACGGTTC 
CATGAGGATCGTGGGGCCTAGGACCTG 

TAGTAACACGTGGCATGGAACATTCCCCATTAACGCGTACACCACGGGCCCCTG 
CACGCCCTCCCCGGCGCCAAATTATT 

CTAGGGCGCTGTGGCGGGTGGCTGCTGAGGAGTACGTGGAGGTTACGCGGGTG 
GGGGATTTCCACTACGTGACGGGCATG 

ACCACTGACAACGTAAAGTGCCCGTGTCAGGTTCCGGCCCCCGAATTCTTCACA 
GAAGTGGATGGGGTGCGGTTGCACAG 

GTACGCTCCAGCGTGCAAACCCCTCCTACGGGAGGAGGTCACATTCCTGGTCGG 
GCTCAATCAATACCTGGTTGGGTCAC 
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AGCTCCCATGCGAGCCCGAACCGGACGTAGCAGTGCTCACTTCCATGCTCACCG 
ACCCCTCCCACATTACGGCGGAGACG 

GCTAAGCGTAGGCTGGCCAGGGGATCTCCCCCCTCCTTGGCCAGCTCATCAGCT 
ATCCAGCTGTCTGCGCCTTCCTTGAA 

GGCAACATGCACTACCCGTCATGACTCCCCGGACGCTGACCTCATCGAGGCCAA 
CCTCCTGTGGCGGCAGGAGATGGGCG 

GGAACATCACCCGCGTGGAGTCAGAAAATAAGGTAGTAATTTTGGACTCTTTCG 
AGCCGCTCCAAGCGGAGGAGGATGAG 

AGGGAAGTATCCGTTCCGGCGGAGATCCTGCGGAGGTCCAGGAAATTCCCTCGA 
GCGATGCCCATATGGGCACGCCCGGA 

TTACAACCCTCCACTGTTAGAGTCCTGGAAGGACCCGGACTACGTCCCTCCAGT 
GGTACACGGGTGTCCATTGCCGCCTG 

CCAAGGCCCCTCCGATACCACCTCCACGGAGGAAGAGGACGGTTGTCCTGTCAG 
AATCTACCGTGTCTTCTGCCTTGGCG 

GAGCTCGCCACAAAGACCTTCGGCAGCTCCGAATCGTCGGCCGTCGACAGCGG 
CACGGCAACGGCCTCTCCTGACCAGCC 

CTCCGACGACGGCGACGCGGGATCCGACGTTGAGTCGTACTCCTCCATGCCCCC 
CCTTGAGGGGGAGCCGGGGGATCCCG 

ATCTCAGCGACGGGTCTTGGTCTACCGTAAGCGAGGAGGCTAGTGAGGACGTCG 
TCTGCTGCTCGATGTCCTACACATGG 

ACAGGCGCCCTGATCACGCCATGCGCTGCGGAGGAAACCAAGCTGCCCATCAA 
TGCACTGAGCAACTCTTTGCTCCGTCA 

CCACAACTTGGTCTATGCTACAACATCTCGCAGCGCAAGCCTGCGGCAGAAGAA 
GGTCACCTTTGACAGACTGCAGGTCC 

TGGACGACCACTACCGGGACGTGCTCAAGGAGATGAAGGCGAAGGCGTCCACA 
GTTAAGGCTAAACTTCTATCCGTGGAG 

GAAGCCTGTAAGCTGACGCCCCCACATTCGGCCAGATCTAAATTTGGCTATGGG 
GCAAAGGACGTCCGGAACCTATCCAG 

CAAGGCCGTTAACCACATCCGCTCCGTGTGGAAGGACTTGCTGGAAGACACTGA 
GACACCAATtGACACTACCATCATGG 

CCAAGAACGAGGTTTTCTGCGTTCAGCCTGAGAAGGGGGGTCGTAAGCCAGCTC 
GTCTCATCGTGTTCCCCGACCTGGGC 

GTGCGCGTGTGCGAGAAGATGGCCCTGTACGACGTGGTTAGCAAGCTCCCCCTG 
GCCGTGATGGGAAGCTCCTACGGATT 

CCAATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAGCGTGGAAGTCCAA 
GAAGACCCCGATGGGGTTCTCGTATG 

ATACCCGCTGTTTTGACTCCACAGTCACTGAGAGCGACATCCGTACGGAGGAGG 
CAATTTACCAATGTTGTGACCTGGAC 

CCCCAAGCCCGCGTGGCCATCAAGTCCCTCACTGAGAGGCTTTATGTTGGGGGC 
CCTCTTACCAATTCAAGGGGGGAAAA 

CTGCGGCTACCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTGGTAA 
CACCCTCACTTGCTACATCAAGGCCC 
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GGGCAGCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCTCGTGTGTGGCG 
ACGACTTAGTCGTTATCTGTGAAAGT 

GCGGGGGTCCAGGAGGACGCGGCGAGCCTGAGAGCCTTCACGGAGGCTATGAC 
CAGGTACTCCGCCCCCCCCGGGGACCC 

CCCACAACCAGAATACGACTTGGAGCTTATAACATCATGCTCCTCCAACGTGTC 
AGTCGCCCACGACGGCGCTGGAAAGA 

GGGTCTACTACCTTACCCGTGACCCTACAACCCCCCTCGCGAGAGCCGCGTGGG 
AGACAGCAAGACACACTCCAGTCAAT 

TCCTGGCTAGGCAACATAATCATGTTTGCCCCCACACTGTGGGCGAGGATGATA 
CTGATGACCCATTTCTTTAGCGTCCT 

CATAGCCAGGGATCAGCTTGAACAGGCTCTTAACTGTGAGATCTACGGAGCCTG 
CTACTCCATAGAACCACTGGATCTAC 

CTCCAATCATTCAAAGACTCCATGGCCTCAGCGCATTrTCACTCCACAGTTACTC 
TCCAGGTGAAATCAATAGGGTGGCC 

GCATGCCTCAGAAAACTTGGGGTCCCGCCCTTGCGAGCTTGGAGACACCGGGCC 
CGGAGCGTCCGCGCTAGGCTTCTGTC 

CAGAGGAGGCAGGGCTGCCATATGTGGCAAGTACCTCTTCAACTGGGCAGTAA 
GAACAAAGCTCAAACTCACTCCAATAG 

CGGCCGCTGGCCGGCTGGACTTGTCCGGTTGGTTCACGGCTGGCTACAGCGGGG 
GAGACATTTATCACAGCGTGTCTCAT 

GCCCGGCCCCGCTGGTTCTGGTTTTGCCTACTCCTGCTCGCTGCAGGGGTAGGCA 
TCTACCTCCTCCCCAACCGATGAAG 

GTrGGGGTAAACACTCCGGCCTCTTAAGCCATTTCCTG l 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 
TTTTTTn 1 rCTTTTTTTTTTTC T 

TTCCTITCCTTC llUUlU^ C ClllCliril CCCTTCTrTAATGGTGGCTCCATCTrA 
GCCCTAGTCACGGCTAGCTGTGA 

AAGGTCCGTGAGCCGCATGACTGCAGAGAGTGCTGATACTGGCCTCTCTGCAGA 
TCAaGT 

BB7-F3(C): [SEQ ID NO:5] 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAA 
CTACTGTCTTCACGCAGAAAGCGTCT 

AGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGA 
GAGCCATAGTGGTCTGCGGAACCGGT 

GAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCA 
ATGCCTGGAGATTTGGGCGTGCCCCC 

GCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCT 
GATAGGGTGCTTGCGAGTGCCCCGGG 

AGGTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAAC 
CAAAGGGCGCGCCATGATTGAACAAGA 

TGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGA 
CTGGGCACAACAGACAATCGGCTGCT 



41 



WO 03/085084 




PCT/US03/10177 



CTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCl'l'l'l'rGTCAA 
GACCGACCTGTCCGGTGCCCTGAAT 

GAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCC 
TTGCGCAGCTGTGCTCGACGTTGTCAC 

TGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCC 
TGTCATCTCACCTTGCTCCTGCCGAGA 

AAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTA 
CCTGCCCATTCGACCACCAAGCGAAA 

CATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGAT 
GATCTGGACGAAGAGCATCAGGGGCT 

CGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGG 
ATCTCGTCGTGACCCATGGCGATGCCT 

GCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTG 
GCCGGCTGGGTGTGGCGGACCGCTAT 

CAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGG 

TATCGCCGCTCCCGATTCGCAGCGCATCGCCITCTATCGCCITCTTGACGAGTTC 
TTCTGAGTTTAAACAGACCACAACG 

GTTTCCCTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTA 
CTGGCCGAAGCCGCTTGGAATAAGG 

CCGGTGTGCGTTTGTCTATATGTTATITTCCACCATATrGCCGTCTTTTGGCAATG 
TGAGGGCCCGGAAACCTGGCCCTG 

TCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAG 
GTCTGTTGAATGTCGTGAAGGAAGCA 

GTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGG 
CAGCGGAACCCCCCACCTGGCGACAG 

GTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCAC 
AACCCCAGTGCCACGTTGTGAGTTGGA 

TAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGA 
AGGATGCCCAGAAGGTACCCCATTGT 

ATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGG 
TTAAAAAACGTCTAGGCCCCCCGAAC 

CACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATACCATGGCGCCTATTAC 
GGCCTACTCCCAACAGACGCGAGGCC 

TACTTGGCTGCATCATCACTAGCCTCACAGGCCGGGACAGGAACCAGGTCGAGG 
GGGAGGTCCAAGTGGTCTCCACCGCA 

ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 
GTGCCGGCTCAAAGACCCTTGCCGG 

CCCAAAGGGCCCAATCACCCAAATGTACACCAATGTGGACCAGGACCTCGTCG 
GCTGGCAAGCGCCCCCCGGGGCGCGTT 

CCTTGACACCATGCACCTGCGGCAGCTCGGACCTTTACTTGGTCACGAGGCATG 
CCGATGTCATTCCGGTGCGCCGGCGG 
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GGCGACAGCAGGGGGAGCCTACTCTCCCCCAGGCCCGTCTCCTACTTGAAGGGC 
TCTTCGGGCGGTCCACTGCTCTGCCC 

CTCGGGGCACGCTGTGGGCATCTTTCGGGCTGCCGTGTGCACCCGAGGGGTTGC 
GAAGGCGGTGGACTTTGTACCCGTCG 

AGTCTATGGAAACCACTATGCGGTCCCCGGTCTTCACGGACAACTCGTCCCCTC 
CGGCCGTACCGCAGACATTCCAGGTG 

GCCCATCTACACGCCCCTACTGGTAGCGGCAAGAGCACTAAGGTGCCGGCTGCG 
TATGCAGCCCAAGGGTATAAGGTGCT 

TGTCCTGAACCCGTCCGTCGCCGCCACCCTAGGTTTCGGGGCGTATATGTCTAA 
GGCACATGGTATCGACCCTAACATCA 

GAACCGGGGTAAGGACCATCACCACGGGTGCCCCCATCACGTACTCCACCTATG 
GCAAGTTTCTTGCCGACGGTGGTTGC 

TCTGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAACTGACTCG 
ACCACTATCCTGGGCATCGGCACAGT 

CCTGGACCAAGCGGAGACGGCTGGAGCGCGACTCGTCGTGCTCGCCACCGCTA 
CGCCTCCGGGATCGGTCACCGTGCCAC 

ATCCAAACATCGAGGAGGTGGCTCTGTCCAGCACTGGAGAAATCCCCTnTATG 
GCAAAGCCATCCCCATCGAGACCATC 

AAGGGGGGGAGGCACCTCATTTTCTGCCATTCCAAGAAGAAATGTGATGAGCTC 
GCCGCGAAGCTGTCCGGCCTCGGACT 

CAATGCTGTAGCATATTACCGGGGCCTTGATGTATCCGTCATACCAACTAGCGG 
AGACGTCATTGTCGTAGCAACGGACG 

CTCTAATGACGGGCTTTACCGGCGATTTCGACTCAGTGATCGACTGCAATACAT 
GTGTCACCCAGACAGTCGACTTCAGC 

CTGGACCCGACCTTCACCATTGAGACGACGACCGTGCCACAAGACGCGGTGTCA 
CGCTCGCAGCGGCGAGGCAGGACTGG 

TAGGGGCAGGATGGGCATTTACAGGTTTGTGACTCCAGGAGAACGGCCCTCGG 
GCATGTTCGATTCCTCGGTTCTGTGCG 

AGTGCTATGACGCGGGCTGTGCTTGGTACGAGCTCACGCCCGCCGAGACCTCAG 
TTAGGTTGCGGGCTTACCTAAACACA 

CCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGGGAGAGCGTCTTTACA 
GGCCTCACCCACATAGACGCCCATTT 

CTTGTCCCAGACTAAGCAGGCAGGAGACAACTTCCCCTACCTGGTAGCATACCA 
GGCTACGGTGTGCGCCAGGGCTCAGG 

CTCCACCTCCATCGTGGGACCAAATGTGGAAGTGTCTCATACGGCTAAAGCCTA 
CGCTGCACGGGCCAACGCCCCTGCTG 

TATAGGCTGGGAGCCGTTCAAAACGAGGTTACTACCACACACCCCATAACCAAA 
TACATCATGGCATGCATGTCGGCTGA 

CCTGGAGGTCGTCACGAGCACCTGGGTGCTGGTAGGCGGAGTCCTAGCAGCTCT 
GGCCGCGTATTGCCTGACAACAGGCA 

GCGTGGTCATTGTGGGCAGGATCATCTTGTCCGGAAAGCCGGCCATCATTCCCG 
ACAGGGAAGTCCTTTACCGGGAGTTC 
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GATGAGATGGAAGAGTGCGCCTCACACCTCCCTTACATCGAACAGGGAATGCA 
GCTCGCCGAACAATTCAAACAGAAGGC 

AATCGGGTTGCTGCAAACAGCCACCAAGCAAGCGGAGGCTGCTGCTCCCGTGGT 
GGAATCCAAGTGGCGGACCCTCGAAG 

CCTTCTGGGCGAAGCATATGTGGAATTTCATCAGCGGGATACAATATTTAGCAG 
GCTTGTCCACTCTGCCTGGCAACCCC 

GCGATAGCATCACTGATGGCATTCACAGCCTCTATCACCAGCCCGCTCACCACC 
CAACATACCCTCCTGTTTAACATCCT 

GGGGGGATGGGTGGCCGCCCAACTTGCTCCTCCCAGCGCTGCTTCTGCTTTCGT 
AGGCGCCGGCATCGCTGGAGCGGCTG 

TTGGCAGCATAGGCCTTGGGAAGGTGCTTGTGGATATTTTGGCAGGTTATGGAG 
CAGGGGTGGCAGGCGCGCTCGTGGCC 

TTTAAGGTCATGAGCGGCGAGATGCCCTCCACCGAGGACCTGGTTAACCTACTC 
CCTGCTATCCTCTCCCCTGGCGCCCT 

AGTCGTCGGGGTCGTGTGCGCAGCGATACTGCGTCGGCACGTGGGCCCAGGGG 
AGGGGGCTGTGCAGTGGATGAACCGGC 

TGATAGCGTTCGCTTCGCGGGGTAACCACGTCTCCCCCACGCACTATGTGCCTG 
AGAGCGACGCTGCAGCACGTGTCACT 

CAGATCCTCTCTAGTCTTACCATCACTCAGCTGCTGAAGAGGCTTCACCAGTGG 
ATCAACGAGGACTGCTCCACGCCATG 

CTCCGGCTCGTGGCTAAGAGATGTTTGGGATTGGATATGCACGGTGTTGACTGA 
TTTCAAGACCTGGCTCCAGTCCAAGC 

TCCTGCCGCGATTGCCGGGAGTCCCCTTCTTCTCATGTCAACGTGGGTACAAGG 
GAGTCTGGCGGGGCGACGGCATCATG 

CAAACCACCTGCCCATGTGGAGCACAGATCACCGGACATGTGAAAAACGGTTC 
CATGAGGATCGTGGGGCCTAGGACCTG 

TAGTAACACGTGGCATGGAACATTCCCCATTAACGCGTACACCACGGGCCCCTG 
CACGCCCTCCCCGGCGCCAAATTATT 

CTAGGGCGCTGTGGCGGGTGGCTGCTGAGGAGTACGTGGAGGTTACGCGGGTG 
GGGGATTTCCACTACGTGACGGGCATG 

ACCACTGACAACGTAAAGTGCCCGTGTCAGGTTCCGGCCCCCGAATTCTTCACA 
GAAGTGGATGGGGTGCGGTTGCACAG 

GTACGCTCCAGCGTGCAAACCCCTCCTACGGGAGGAGGTCACATTCCTGGTCGG 
GCTCAATCAATACCTGGTTGGGTCAC 

AGCTCCCATGCGAGCCCGAACCGGACGTAGCAGTGCTCACTTCCATGCTCACCG 
ACCCCTCCCACATTACGGCGGAGACG 

GCTAAGCGTAGGCTGGCCAGGGGATCTCCCCCCTCCTTGGCCAGCTCATCAGCT 
ATCCAGCTGTCTGCGCCTTCCTTGAA 

GGCAACATGCACTACCCGTCATGACTCCCCGGACGCTGACCTCATCGAGGCCAA 
CCTCCTGTGGCGGCAGGAGATGGGCG 

GGAACATCACCCGCGTGGAGTCAGAAAATAAGGTAGTAATTTTGGACTCTTTCG 
AGCCGCTCCAAGCGGAGGAGGATGAG 
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AGGGAAGTATCCGTTCCGGCGGAGATCCTGCGGAGGTCCAGGAAATTCCCTCGA 
GCGATGCCCATATGGGCACGCCCGGA 

TTACAACCCTCCACTGTTAGAGTCCTGGAAGGACCCGGACTACGTCCCTCCAGT 
GGTACACGGGTGTCCATTGCCGCCTG 

CCAAGGCCCCTCCGATACCACCTCCACGGAGGAAGAGGACGGTTGTCCTGTCAG 
AATCTACCGTGTCTTCTGCCTTGGCG 

GAGCTCGCCACAAAGACCTTCGGCAGCTCCGAATCGTCGGCCGTCGACAGCGG 
CACGGCAACGGCCTCTCCTGACCAGCC 

CTCCGACGACGGCGACGCGGGATCCGACGTTGAGTCGTACTCCTCCATGCCCCC 
CCTTGAGGGGGAGCCGGGGGATCCCG 

ATCTCAGCGACGGGTCTTGGTCTACCGTAAGCGAGGAGGCTAGTGAGGACGTCG 
TCTGCTGCTCGATGTCCTACACATGG 

ACAGGCGCCCTGATCACGCCATGCGCTGCGGAGGAAACCAAGCTGCCCATCAA 
TGCACTGAGCAACTCTTTGCTCCGTCA 

CCACAACTTGGTCTATGCTACAACATCTCGCAGCGCAAGCCTGCGGCAGAAGAA 
GGTCACCTTTGACAGACTGCAGGTCC 

TGGACGACCACTACCGGGACGTGCTCAAGGAGATGAAGGCGAAGGCGTCCACA 
GTTAAGGCTAAACTTCTATCCGTGGAG 

GAAGCCTGTAAGCTGACGCCCCCACATTCGGCCAGATCTAAATTTGGCTATGGG 
GCAAAGGACGTCCGGAACCTATCCAG 

CAAGGCCGTTAACCACATCCGCTCCGTGTGGAAGGACTTGCTGGAAGACACTGA 
GACACCAATtGACACTACCATCATGG 

CCAAGAACGAGGTTTTCTGCGTTCAGCCTGAGAAGGGGGGTCGTAAGCCAGCTC 
GTCTCATCGTGTTCCCCGACCTGGGC 

GTGCGCGTGTGCGAGAAGATGGCCCTGTACGACGTGGTTAGCAAGCTCCCCCTG 
GCCGTGATGGGAAGCTCCTACGGATT 

CCAATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAGCGTGGAAGTCCAA 
GAAGACCCCGATGGGGTTCTCGTATG 

ATACCCGCTGTTTTGACTCCACAGTCACTGAGAGCGACATCCGTACGGAGGAGG 
CAATTTACCAATGTTGTGACCTGGAC 

CCCCAAGCCCGCGTGGCCATCAAGTCCCTCACTGAGAGGCTTTATGTTGGGGGC 
CCTCTTACCAATTCAAGGGGGGAAAA 

CTGCGGCTACCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTGGTAA 
CACCCTCACTTGCTACATCAAGGCCC 

GGGCAGCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCTCGTGTGTGGCG 
ACGACTTAGTCGTTATCTGTGAAAGT 

GCGGGGGTCCAGGAGGACGCGGCGAGCCTGAGAGCCTTCACGGAGGCTATGAC 
CAGGTACTCCGCCCCCCCCGGGGACCC 

CCCACAACCAGAATACGACTTGGAGCTTATAACATCATGCTCCTCCAACGTGTC 
AGTCGCCCACGACGGCGCTGGAAAGA 

GGGTCTACTACCTTACCCGTGACCCTACAACCCCCCTCGCGAGAGCCGCGTGGG 
AGACAGCAAGACACACTCCAGTCAAT 
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TCCTGGCTAGGCAACATAATCATGTTTGCCCCCACACTGTGGGCGAGGATGATA 
CTGATGACCCATTTCTTTAGCGTCCT 

CATAGCCAGGGATCAGCTTGAACAGGCTCTTAACTGTGAGATCTACGGAGCCTG 
CTACTCCATAGAACCACTGGATCTAC 

CTCCAATCATTCAAAGACTCCATGGCCTCAGCGCATTTTCACTCCACAGTTACTC 
TCCAGGTGAAATCAATAGGGTGGCC 

GCATGCCTCAGAAAACTTGGGGTCCCGCCCTTGCGAGCTTGGAGACACCGGGCC 
CGGAGCGTCCGCGCTAGGCTTCTGTC 

CAGAGGAGGCAGGGCTGCCATATGTGGCAAGTACCTCTTCAACTGGGCAGTAA 
GAACAAAGCTCAAACTCACTCCAATAG 

CGGCCGCTGGCCGGCTGGACTTGTCCGGTTGGTTCACGGCTGGCTACAGCGGGG 
GAGACATTTATCACAGCGTGTCTCAT 

GCCCGGCCCCGCTGGTTCTGGTTTTGCCTACTCCTGCTCGCTGCAGGGGTAGGCA 
TCTACCTCCTCCCCAACCGATGAAG 

GTTGGGGTAAACACTCCGGCCTCTTAAGCCATTTCCTG'l i ri l l l il l 1 1 1 1 1 1 1 1 
TTTTTTTTTTCTTTTTTTTTTTCT 

TTCCTTTCCTTC llUliriCCIllClUiUr CCCrTCTTTAATGGTGGCTCCATCTrA 
GCCCTAGTCACGGCTAGCTGTGA 

AAGGTCCGTGAGCCGCtTGACTGCAGAGAGTGCTGATACTGGCCTCTCTGCAGA 
TCAaGT 

BB7/H77NS5B: [SEQ ID NO:6] 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAA 
CTACTGTCTTCACGCAGAAAGCGTCT 

AGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGA 
GAGCCATAGTGGTCTGCGGAACCGGT 

GAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCA 
ATGCCTGGAGATTTGGGCGTGCCCCC 

GCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCT 
GATAGGGTGCTTGCGAGTGCCCCGGG 

AGGTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAAC 
CAAAGGGCGCGCCATGATTGAACAAGA 

TGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGA 
CTGGGCACAACAGACAATCGGCTGCT 

CTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCITTTTGTCAA 
GACCGACCTGTCCGGTGCCCTGAAT 

GAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCC 
TTGCGCAGCTGTGCTCGACGTTGTCAC 

TGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCC 
TGTCATCTCACCTTGCTCCTGCCGAGA 

AAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTA 
CCTGCCCATTCGACCACCAAGCGAAA 
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CATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGAT 
GATCTGGACGAAGAGCATCAGGGGCT 

CGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGG 
ATCTCGTCGTGACCCATGGCGATGCCT 

GCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTG 
GCCGGCTGGGTGTGGCGGACCGCTAT 

CAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGG 

TATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTC 
TTCTGAGTTTAAACAGACCACAACG 

GnTCCCTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGITA 
CTGGCCGAAGCCGCTTGGAATAAGG 

CCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATATTGCCGTCTTTTGGCAATG 
TGAGGGCCCGGAAACCTGGCCCTG 

TCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAG 
GTCTGTTGAATGTCGTGAAGGAAGCA 

GTTCCTCTGGAAGCTTCITGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGG 
CAGCGGAACCCCCCACCTGGCGACAG 

GTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCAC 
AACCCCAGTGCCACGTTGTGAGTTGGA 

TAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGA 
AGGATGCCCAGAAGGTACCCCATTGT 

ATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGG 
TTAAAAAACGTCTAGGCCCCCCGAAC 

CACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATACCATGGCGCCTATTAC 
GGCCTACTCCCAACAGACGCGAGGCC 

TACTTGGCTGCATCATCACTAGCCTCACAGGCCGGGACAGGAACCAGGTCGAGG 
GGGAGGTCCAAGTGGTCTCCACCGCA 

ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 
GTGCCGGCTCAAAGACCCTTGCCGG 

CCCAAAGGGCCCAATCACCCAAATGTACACCAATGTGGACCAGGACCTCGTCG 
GCTGGCAAGCGCCCCCCGGGGCGCGTT 

CCTTGACACCATGCACCTGCGGCAGCTCGGACCTTTACTTGGTCACGAGGCATG 
CCGATGTCATTCCGGTGCGCCGGCGG 

GGCGACAGCAGGGGGAGCCTACTCTCCCCCAGGCCCGTCTCCTACTTGAAGGGC 
TCTTCGGGCGGTCCACTGCTCTGCCC 

CTCGGGGCACGCTGTGGGCATCTTTCGGGCTGCCGTGTGCACCCGAGGGGTTGC 
GAAGGCGGTGGACTTTGTACCCGTCG 

AGTCTATGGAAACCACTATGCGGTCCCCGGTCTTCACGGACAACTCGTCCCCTC 
CGGCCGTACCGCAGACATTCCAGGTG 

GCCCATCTACACGCCCCTACTGGTAGCGGCAAGAGCACTAAGGTGCCGGCTGCG 
TATGCAGCCCAAGGGTATAAGGTGCT 
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TGTCCTGAACCCGTCCGTCGCCGCCACCCTAGGTTTCGGGGCGTATATGTCTAA 
GGCACATGGTATCGACCCTAACATCA 

GAACCGGGGTAAGGACCATCACCACGGGTGCCCCCATCACGTACTCCACCTATG 
GCAAGTTTCTTGCCGACGGTGGTTGC 

TCTGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAACTGACTCG 
ACCACTATCCTGGGCATCGGCACAGT 

CCTGGACCAAGCGGAGACGGCTGGAGCGCGACTCGTCGTGCTCGCCACCGCTA 
CGCCTCCGGGATCGGTCACCGTGCCAC 

ATCCAAACATCGAGGAGGTGGCTCTGTCCAGCACTGGAGAAATCCCCTTTTATG 
GCAAAGCCATCCCCATCGAGACCATC 

AAGGGGGGGAGGCACCTCATTTTCTGCCATTCCAAGAAGAAATGTGATGAGCTC 
GCCGCGAAGCTGTCCGGCCTCGGACT 

CAATGCTGTAGCATATTACCGGGGCCTTGATGTATCCGTCATACCAACTAGCGG 
AGACGTCATTGTCGTAGCAACGGACG 

CTCTAATGACGGGCTTTACCGGCGATTTCGACTCAGTGATCGACTGCAATACAT 
GTGTCACCCAGACAGTCGACTTCAGC 

CTGGACCCGACCTTCACCATTGAGACGACGACCGTGCCACAAGACGCGGTGTCA 
CGCTCGCAGCGGCGAGGCAGGACTGG 

TAGGGGCAGGATGGGCATTTACAGGTTTGTGACTCCAGGAGAACGGCCCTCGG 
GCATGTTCGATTCCTCGGTTCTGTGCG 

AGTGCTATGACGCGGGCTGTGCTTGGTACGAGCTCACGCCCGCCGAGACCTCAG 
TTAGGTTGCGGGCTTACCTAAACACA 

CCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGGGAGAGCGTCTTTACA 
GGCCTCACCCACATAGACGCCCATTT 

CTTGTCCCAGACTAAGCAGGCAGGAGACAACTTCCCCTACCTGGTAGCATACCA 
GGCTACGGTGTGCGCCAGGGCTCAGG 

CTCCACCTCCATCGTGGGACCAAATGTGGAAGTGTCTCATACGGCTAAAGCCTA 
CGCTGCACGGGCCAACGCCCCTGCTG 

TATAGGCTGGGAGCCGTTCAAAACGAGGTTACTACCACACACCCCATAACCAAA 
TACATCATGGCATGCATGTCGGCTGA 

CCTGGAGGTCGTCACGAGCACCTGGGTGCTGGTAGGCGGAGTCCTAGCAGCTCT 
GGCCGCGTATTGCCTGACAACAGGCA 

GCGTGGTCATTGTGGGCAGGATCATCTTGTCCGGAAAGCCGGCCATCATTCCCG 
ACAGGGAAGTCCTTTACCGGGAGTTC 

GATGAGATGGAAGAGTGCGCCTCACACCTCCCTTACATCGAACAGGGAATGCA 
GCTCGCCGAACAATTCAAACAGAAGGC 

AATCGGGTTGCTGCAAACAGCCACCAAGCAAGCGGAGGCTGCTGCTCCCGTGGT 
GGAATCCAAGTGGCGGACCCTCGAAG 

CCTTCTGGGCGAAGCATATGTGGAATTTCATCAGCGGGATACAATATTTAGCAG 
GCTTGTCCACTCTGCCTGGCAACCCC 

GCGATAGCATCACTGATGGCATTCACAGCCTCTATCACCAGCCCGCTCACCACC 
CAACATACCCTCCTGTTTAACATCCT 
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GGGGGGATGGGTGGCCGCCCAACTTGCTCCTCCCAGCGCTGCTTCTGCTTTCGT 
AGGCGCCGGCATCGCTGGAGCGGCTG 

TTGGCAGCATAGGCCTTGGGAAGGTGCTTGTGGATATTTTGGCAGGTTATGGAG 
CAGGGGTGGCAGGCGCGCTCGTGGCC 

TTTAAGGTCATGAGCGGCGAGATGCCCTCCACCGAGGACCTGGTTAACCTACTC 
CCTGCTATCCTCTCCCCTGGCGCCCT 

AGTCGTCGGGGTCGTGTGCGCAGCGATACTGCGTCGGCACGTGGGCCCAGGGG 
AGGGGGCTGTGCAGTGGATGAACCGGC 

TGATAGCGTTCGCTTCGCGGGGTAACCACGTCTCCCCCACGCACTATGTGCCTG 
AGAGCGACGCTGCAGCACGTGTCACT 

CAGATCCTCTCTAGTCTTACCATCACTCAGCTGCTGAAGAGGCTTCACCAGTGG 
ATCAACGAGGACTGCTCCACGCCATG 

CTCCGGCTCGTGGCTAAGAGATGTTTGGGATTGGATATGCACGGTGTTGACTGA 
TTTCAAGACCTGGCTCCAGTCCAAGC 

TCCTGCCGCGATTGCCGGGAGTCCCCTTCTTCTCATGTCAACGTGGGTACAAGG 
GAGTCTGGCGGGGCGACGGCATCATG 

CAAACCACCTGCCCATGTGGAGCACAGATCACCGGACATGTGAAAAACGGTTC 
CATGAGGATCGTGGGGCCTAGGACCTG 

TAGTAACACGTGGCATGGAACATTCCCCATTAACGCGTACACCACGGGCCCCTG 
CACGCCCTCCCCGGCGCCAAATTATT 

CTAGGGCGCTGTGGCGGGTGGCTGCTGAGGAGTACGTGGAGGTTACGCGGGTG 
GGGGATTTCCACTACGTGACGGGCATG 

ACCACTGACAACGTAAAGTGCCCGTGTCAGGTTCCGGCCCCCGAATTCTTCACA 
GAAGTGGATGGGGTGCGGTTGCACAG 

GTACGCTCCAGCGTGCAAACCCCTCCTACGGGAGGAGGTCACATTCCTGGTCGG 
GCTCAATCAATACCTGGTTGGGTCAC 

AGCTCCCATGCGAGCCCGAACCGGACGTAGCAGTGCTCACTTCCATGCTCACCG 
ACCCCTCCCACATTACGGCGGAGACG 

GCTAAGCGTAGGCTGGCCAGGGGATCTCCCCCCTCCTTGGCCAGCTCATCAGCT 
ATCCAGCTGTCTGCGCCTTCCTTGAA 

GGCAACATGCACTACCCGTCATGACTCCCCGGACGCTGACCTCATCGAGGCCAA 
CCTCCTGTGGCGGCAGGAGATGGGCG 

GGAACATCACCCGCGTGGAGTCAGAAAATAAGGTAGTAATTTTGGACTCTTTCG 
AGCCGCTCCAAGCGGAGGAGGATGAG 

AGGGAAGTATCCGTTCCGGCGGAGATCCTGCGGAGGTCCAGGAAATTCCCTCGA 
GCGATGCCCATATGGGCACGCCCGGA 

TTACAACCCTCCACTGTTAGAGTCCTGGAAGGACCCGGACTACGTCCCTCCAGT 
GGTACACGGGTGTCCATTGCCGCCTG 

CCAAGGCCCCTCCGATACCACCTCCACGGAGGAAGAGGACGGTTGTCCTGTCAG 
AATCTACCGTGTCTTCTGCCTTGGCG 

GAGCTCGCCACAAAGACCTTCGGCAGCTCCGAATCGTCGGCCGTCGACAGCGG 
CACGGCAACGGCCTCTCCTGACCAGCC 
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CTCCGACGACGGCGACGCGGGATCCGACGTTGAGTCGTACTCCTCCATGCCCCC 
CCTTGAGGGGGAGCCGGGGGATCCCG 

ATCTCAGCGACGGGTCTTGGTCTACCGTAAGCGAGGAGGCTAGTGAGGACGTCG 
TCTGCTGCTCGATGTCCTACACATGG 

ACAGGCGCCCTGATCACCCCGTGCGCTGCGGAAGAACAAAAACTGCCCATCAA 
CGCACTGAGCAACTCGTTGCTACGCCA 

TCACAATCTGGTGTATTCCACCACTTCACGCAGTGCTTGCCAAAGGCAGAAGAA 
AGTCACATTTGACAGACTGCAAGTTC 

TGGACAGCCATTACCAGGACGTGCTCAAGGAGGTCAAAGCAGCGGCGTCAAAA 
GTGAAGGCTAACTTGCTATCCGTAGAG 

GAAGCTTGCAGCCTGACGCCCCCACATTCAGCCAAATCCAAGTTTGGCTATGGG 
GCAAAAGACGTCCGTTGCCATGCCAG 

AAAGGCCGTAGCCCACATCAACTCCGTGTGGAAAGACCTTCTGGAAGACAGTGT 
AACACCAATAGACACTACCATCATGG 

CCAAGAACGAGGTTTTCTGCGTTCAGCCTGAGAAGGGGGGTCGTAAGCCAGCTC 
GTCTCATCGTGTTCCCCGACCTGGGC 

GTGCGCGTGTGCGAGAAGATGGCCCTGTACGACGTGGTTAGCAAGCTCCCCCTG 
GCCGTGATGGGAAGCTCCTACGGATT 

CCAATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAGCGTGGAAGTCCAA 
GAAGACCCCGATGGGGTTCTCGTATG 

ATACCCGCTGTTTTGACTCCACAGTCACTGAGAGCGACATCCGTACGGAGGAGG 
CAATTTACCAATGTTGTGACCTGGAC 

CCCCAAGCCCGCGTGGCCATCAAGTCCCTCACTGAGAGGCTTTATGTTGGGGGC 
CCTCTTACCAATTCAAGGGGGGAAAA 

CTGCGGCTACCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTGGTAA 
CACCCTCACTTGCTACATCAAGGCCC 

GGGCAGCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCTCGTGTGTGGCG 
ACGACTTAGTCGTTATCTGTGAAAGT 

GCGGGGGTCCAGGAGGACGCGGCGAGCCTGAGAGCCTTCACGGAGGCTATGAC 
CAGGTACTCCtCCCCCCCCGGGGACCC 

CCCACAACCAGAATACGACTTGGAGCTTATAACATCATGCTCCTCCAACGTGTC 
AGTCGCCCACGACGGCGCTGGAAAGA 

GGGTCTACTACCTTACCCGTGACCCTACAACCCCCCTCGCGAGAGCCGCGTGGG 
AGACAGCAAGACACACTCCAGTCAAT 

TCCTGGCTAGGCAACATAATCATGTTTGCCCCCACACTGTGGGCGAGGATGATA 
CTGATGACCCATTTCTTTAGCGTCCT 

CATAGCCAGGGATCAGCTTGAACAGGCTCTTAACTGTGAGATCTACGGAGCCTG 
CTACTCCATAGAACCACTGGATCTAC 

CTCCAATCATTCAAAGACTCCATGGCCTCAGCGCATTTTCACTCCACAGTTACTC 
TCCAGGTGAAATCAATAGGGTGGCC 

GCATGCCTCAGAAAACTTGGGGTCCCGCCCTTGCGAGCTTGGAGACACCGGGCC 
CGGAGCGTCCGCGCTAGGCTTCTGTC 
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CAGAGGAGGCAGGGCTGCCATATGTGGCAAGTACCTCTTCAACTGGGCAGTAA 
GAACAAAGCTCAAACTCACTCCAATAG 

CGGCCGCTGGCCGGCTGGACTTGTCCGGTTGGTTCACGGCTGGCTACAGCGGGG 
GAGACATTTATCACAGCGTGTCTCAT 

GCCCGGCCCCGCTGGTTCTGGTTTTGCCTACTCCTGCTCGCTGCAGGGGTAGGCA 
TCTACCTCCTCCCCAACCGATGAAG 

GTrtacGTAAACACTCCAGGCCAATAGGCCATCCTGl l ri l l lCCCl 1 1 1 1 1 1 1 n i 

CTTTTTTTTTTTTTTTTTTI 1 1 1 

TTTTTTTTTTTTCTCCTTTTTTITTCCT 

CCATCTTAGCCCTAGTCACGGC 

TAGCTGTGAAAGGTCCGTGAGCCGCTTGACTGCAGAGAGTGCTGATACTGGCCT 
CTCTGCAGATCAAGT 

HCV la replicon: [SEQ ID NO:7] 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAA 
CTACTGTCTTCACGCAGAAAGCGTCT 

AGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGA 
GAGCCATAGTGGTCTGCGGAACCGGT 

GAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCA 
ATGCCTGGAGATTTGGGCGTGCCCCC 

GCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCT 
GATAGGGTGCTTGCGAGTGCCCCGGG 

AGGTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAAC 
CAAAGGGCGCGCCATGATTGAACAAGA 

TGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGA 
CTGGGCACAACAGACAATCGGCTGCT 

CTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAA 
GACCGACCTGTCCGGTGCCCTGAAT 

GAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCC 
TTGCGCAGCTGTGCTCGACGTTGTCAC 

TGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCC 
TGTCATCTCACCTTGCTCCTGCCGAGA 

AAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTA 
CCTGCCCATTCGACCACCAAGCGAAA 

CATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGAT 
GATCTGGACGAAGAGCATCAGGGGCT 

CGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGG 
ATCTCGTCGTGACCCATGGCGATGCCT 

GCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTG 
GCCGGCTGGGTGTGGCGGACCGCTAT 

CAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGG 
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TATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCITCTTGACGAGTTC 
TTCTGAGTTTAAACAGACCACAACG 

GnTCCCTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTA 
CTGGCCGAAGCCGCTTGGAATAAGG 

CCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATATTGCCGTCTTTTGGCAATG 
TGAGGGCCCGGAAACCTGGCCCTG 

TCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAG 
GTCTGTTGAATGTCGTGAAGGAAGCA 

GTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGG 
CAGCGGAACCCCCCACCTGGCGACAG 

GTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCAC 
AACCCCAGTGCCACGTTGTGAGTTGGA 

TAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGA 
AGGATGCCCAGAAGGTACCCCATTGT 

ATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGG 
TTAAAAAACGTCTAGGCCCCCCGAAC 

CACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATACCATGGCGCCTATTAC 
GGCCTACTCCCAACAGACGCGAGGCC 

TACTTGGCTGCATCATCACTAGCCTCACAGGCCGGGACAGGAACCAGGTCGAGG 
GGGAGGTCCAAGTGGTCTCCACCGCA 

ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 
GTGCCGGCTCAAAGACCCTTGCCGG 

CCCAAAGGGCCCAATCACCCAAATGTAcACCAATGTGGACCAAGACCTTGTGGG 
CTGGCCCGCTCCTCAAGGTTCCCGCT 

CATTGACACCCTGTACCTGCGGCTCCTCGGACCTTTACCTGGTCACGAGGCACG 
CCGATGTCATTCCCGTGCGCCGGCGA 

GGTGATAGCAGGGGTAGCCTGCTTTCGCCCCGGCCCATTTCCTACTTGAAAGGC 
TCCTCGGGGGGTCCGCTGTTGTGCCC 

CGCGGGACACGCCGTGGGCCTATTCAGGGCCGCGGTGTGCACCCGTGGAGTGG 
CTAAAGCGGTGGACTTTATCCCTGTGG 

AGAACCTAGGGACAACCATGAGATCCCCGGTGTTCACGGACAACTCCTCTCCAC 
CAGCAGTGCCCCAGAGCTTCCAGGTG 

GCCCACCTGCATGCTCCCACCGGCAGCGGTAAGAGCACCAAGGTCCCGGCTGC 
GTACGCAGCCCAGGGCTACAAGGTGTT 

GGTGCTCAACCCCTCTGTTGCTGCAACGCTGGGCTTTGGTGCTTACATGTCCAAG 
GCCCATGGGGTTGATCCTAATATCA 

GGACCGGGGTGAGAACAATTACCACTGGCAGCCCCATCACGTACTCCACCTACG 
GCAAGTTCCTTGCCGACGGCGGGTGC 

TCAGGAGGTGCTTATGACATAATAATTTGTGACGAGTGCCACTCCACGGATGCC 
ACATCCATCTTGGGCATCGGCACTGT 

CCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACTGCTAC 
CCCTCCGGGCTCCGTCACTGTGcCCC 
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ATCCTAACATCGAGGAGGTTGCTCTGTCCACCACCGGAGAGATCCCCTTTTACG 
GCAAGGCTATCCCCCTCGAGGTGATC 

AAGGGGGGAAGACATCTCATCTTCTGCCACTCAAAGAAGAAGTGCGACGAGCT 
CGCCGCGAAGCTGGTCGCATTGGGCAT 

CAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCTGTCATCCCGACCAGCGG 
CGATGTTGTCGTCGTGTCGACCGATG 

CTCTCATGACTGGCTTTACCGGCGACTTCGACTCTGTGATAGACTGCAACACGT 
GTGTCACTCAGACAGTCGATTTCAGC 

CTTGACCCTACCTTTACCATTGAGACAACCACGCTCCCCCAGGATGCTGTCTCCA 
GGACTCAACGCCGGGGCAGGACTGG 

CAGGGGGAAGCCAGGCATCTATAGATTTGTGGCACCGGGGGAGCGCCCCTCCG 
GCATGTTCGACTCGTCCGTCCTCTGTG 

AGTGCTATGACGCGGGCTGTGCTTGGTATGAGCTCACGCCCGCCGAGACTACAG 
TTAGGCTACGAGCGTACATGAACACC 

CCGGGGCITCCCGTGTGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTrACG 
GGCCTCACTCATATAGATGCCCACTT 

TTTATCCCAGACAAAGCAGAGTGGGGAGAACnTCCTTACCTGGTAGCGTACCA 
AGCCACCGTGTGCGCTAGGGCTCAAG 

CCCCTCCCCCATCGTGGGACCAGATGTGGgAGTGTTTGATCCGCCTTAAACCCAC 
CCTCCATGGGCCAACACCCCTGCTA 

TACAGACTGGGCGCTGTTCAGAATGAAGTCACCCTGACGCACCCAATCACCAAA 
TACATCATGACATGCATGTCGGCCGA 

CCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTCT 
GGCCGCGTATTGCCTGTCAACAGGCT 

GCGTGGTCATAGTGGGCAGGATCGTCTTGTCCGGGAAGCCGGCAATTATACCTG 
ACAGGGAGGTTCTCTACCAGGAGTTC 

GATGAGATGGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGAT 
GCTCGCTGAGCAGTTCAAGCAGAAGGC 

CCTCGGCCTCCTGCAGACCGCGTCCCGCCATGCAGAGGTTATCACCCCTGCTGT 
CCAGACCAACTGGCAGAAACTCGAGG 

TCTTrTGGGCGAAGCACATGTGGAATTTCATCAGTGGGATACAATACTTGGCGG 
GCCTGTCAACGCTGCCTGGTAACCCC 

GCCATTGC1TCATTGATGGCTTTTACAGCTGCCGTCACCAGCCCACTAACCACTG 
GCCAAACCCTCCTCTTCAACATATT 

GGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGT 
GGGTGCTGGCCTAGCTGGCGCCGCCA 

TCGGCAGCGTTGGACTGGGGAAGGTCCTCGTGGACATTCTTGCAGGGTATGGCG 
CGGGCGTGGCGGGAGCTCTTGTAGCA 

TTCAAGATCATGAGCGGTGAGGTCCCCTCCACGGAGGACCTGGTCAATCTGCTG 
CCCGCCATCCTCTCGCCTGGAGCCCT 

TGTAGTCGGTGTGGTCTGCGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGA 
GGGGGCAGTGCAATGGATGAACCGGC 
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TAATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGCACTACGTGCCGG 
AGAGCGATGCAGCCGCCCGCGTCACT 

GCCATACTCAGCAGCCTCACTGTAACCCAGCTCCTGAGGCGACTGCATCAGTGG 
ATAAGCTCGGAGTGTACCACTCCATG 

CTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATATGCGtGGTGCTGAGCGA 
CTCTAAGACCTGGCTGAAAGCCAAGC 

TCATGCCACAACTGCCTGGGATTCCCTTTGTGTCCTGCCAGCGCGGGTATAGGG 
GGGTCTGGCGAGGAGACGGCATTATG 

CACACTCGCTGCCACTGTGGAGCTGAGATCACTGGACATGTCAAAAACGGGAC 
GATGAGGATCGTCGGTCCTAGGACCTG 

CAGGAACATGTGGAGTGGGACGTTCCCCATTAACGCCTACACCACGGGCCCCTG 
TACTCCCCTTCCTGCGCCGAACTATA 

AGTTCGCGCTGTGGAGGGTGTCTGCAGAGGAATACGTGGAGATAAGGCGGGTG 
GGGGACTTCCACTACGTATCGGGTATG 

ACTACTGACAATCTTAAATGCCCGTGCCAGATCCCATCGCCCGAATTcTTCACAG 
AATTGGACGGGGTGCGCCTACACAG 

GTTTGCGCCCCCTTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAGG 
ACTCCACGAGTACCCGGTGGGGTCGC 

AATTACCTTGCGAGCCCGAACCGGACGTAGCCGTGTTGACGTCCATGCTCACTG 
ATCCCTCCCATATAACAGCAGAGGCG 

GCCGGGAGAAGGTTGGCGAGAGGGTCACCCCCTTCTATGGCCAGCTCCTCGGCT 
AtCCAGCTGTCCGCTCCATCTCTCAA 

GGCAACTTGCACCGCCAACCATGACTCCCCTGACGCCGAGCTCATAGAGGCTAA 
CCTCCTGTGGAGGCAGGAGATGGGCG 

GCAACATCACCAGGGTTGAGTCAGAGAACAAAGTGGTGATTCTGGACTCCTTCG 
ATCCGCTTGTGGCAGAGGAGGATGAG 

CGGGAGGTCTCCGTACCTGCAGAAATTCTGCGGAAGTCTCGGAGATTCGCCCGG 
GCCCTGCCCGTCTGGGCGCGGCCGGA 

CTACAACCCCCCGCTAGTAGAGACGTGGAAAAAGCCTGACTACGAACCACCTGT 
GGTCCATGGCTGCCCGCTACCACCTC 

CACGGTCCCCTCCTGTGCCTCCGCCTCGGAAAAAGCGTACGGTGGTCCTCACCG 
AATCAACCCTATCTACTGCCTTGGCC 

GAGCTTGCCACCAAAAGTTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGAC 
AATACGACAACATCCTCTGAGCCCGC 

CCCTTCTGGCTGCCCCCCCGACTCCGACGTTGAGTCCTATTCTTCCATGCCCCCC 
CTGGAGGGGGAGCCTGGGGATCCGG 

ATCTCAGCGACGGGTCATGGTCGACGGTCAGTAGTGGGGCCGACACGGAAGAT 
GTCGTGTGCTGCTCAATGTCTTATTCC 

TGGACAGGCGCACTCGTCACCCCGTGCGCTGCGGAAGAACAAAAACTGCCCAT 
CAACGCACTGAGCAACTCGTTGCTACG 

CCATCACAATCTGGTGTATTCCACCACTTCACGCAGTGCTTGCCAAAGGCAGAA 
GAAAGTCACATTTGACAGACTGCAAG 
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TTCTGGACAGCCATTACCAGGACGTGCTCAAGGAGGTCAAAGCAGCGGCGTCA 
AAAGTGAAGGCTAACTTGCTATCCGTA 

GAGGAAGCTTGCAGCCTGACGCCCCCACATTCAGCCAAATCCAAGTTTGGCTAT 
GGGGCAAAAGACGTCCGTTGCCATGC 

CAGAAAGGCCGTAGCCCACATCAACTCCGTGTGGAAAGACCTTCTGGAAGACA 
GTGTAACACCAATtGACACTACCATCA 

TGGCCAAGAACGAGGTTTTCTGCGTTCAGCCTGAGAAGGGGGGTCGTAAGCCA 
GCTCGTCTCATCGTGTTCCCCGACCTG 

GGCGTGCGCGTGTGCGAGAAGATGGCCCTGTACGACGTGaTTAGCAAGCTCCCC 
CTGGCCGTGATGGGAAGCTCCTACGG 

ATTCCAATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAGCGTGGAAGTC 
CAAGAAGACCCCGATGGGGTTCTCGT 

ATGATACCCGCTGTTTTGACTCCACAGTCACTGAGAGCGACATCCGTACGGAGG 
AGGCAATTTACCAATGTTGTGACCTG 

GACCCCCAAGCCCGCGTGGCCATCAAGTCCCTCACTGAGAGGCTTTATGTTGGG 
GGCCCTCTTACCAATTCAAGGGGGGA 

AAACTGCGGCTACCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTG 
GTAACACCCTCACTTGCTACATCAAGG 

CCCGGGCAGCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCTCGTGTGTG 
GaGACGACTTAGTCGTTATCTGTGAA 

AGTGCGGGGGTCCAGGAGGACGCGGCGAGCCTGAGAGCCTTCACGGAGGCTAT 
GACCAGGTACTCCGCCCCCCCCGGGGA 

CCCCCCACAACCAGAATACGACTTGGAGCTTATAACATCATGCTCCTCCAACGT 
GTCAGTCGCCCACGACGGCGCTGGAA 

AGAGGGTCTACTACCTTACCCGTGACCCTACAACCCCCCTCGCGAGAGCCGCGT 
GGGAGACAGCAAGACACACTCCAGTC 

AATTCCTGGCTAGGCAACATAATCATGTTTGCCCCCACACTGTGGGCGAGGATG 
ATACTGATGACCCATTTCTTTAGCGT 

CCTCATAGCCAGGGATCAGCTTGAACAGGCTCTTAACTGTGAGATCTACGGAGC 
CTGCTACTCCATAGAACCACTaGATC 

TACCTCCAATCATTCAAAGACTCCATGGCCTCAGCGCATTTTCACTCCACAGTTA 
CTCTCCAGGTGAAATCAATAGGGTG 

GCCGCATGCCTCAGAAAACTTGGGGTCCCGCCCTTGCGAGCTTGGAGACACCGG 
GCCCGGAGCGTCCGCGCTAGGCTTCT 

GTCCAGAGGAGGCAGGGCTGCCATATGTGGCAAGTACCTCTTCAACTGGGCAGT 
AAGAACAAAGCTCAAACTCACTCCAA 

TAGCGGCCGCTGGCCGGCTGGACTTGTCCGGTTGGTTCACGGCTGGCTACAGCG 
GGGGAGACATTTATCACAGCGTGTCT 

CATGCCCGGCCCCGCTGGTTCTGGTTTTGCCTACTCCTGCTCGCTGCAGGGGTAG 
GCATCTACCTCCTCCCCAACCGATG 

AAGGTTGGGGTAAACACTCCGGCCTCTTAAGCCATTTCCTGl l l 1 1 11 1 1 i 1 1 1 1 A 
TTTTTTTTTTTTTCTTTTTTTTTT 



55 



WO 03/085084 




PC17US03/10177 



TCTTTCCHTTCCnTCITTTTTTC 
TTAGCCCTAGTCACGGCTAGCTG 

TGAAAGGTCCGTGAGCCGCtTGACTGCAGAGAGTGCTGATACTGGCCTCTCTGC 
AGATCAaGT 

(B) Sequences from HCV J4 Replicons 
J4M/S:[SEQIDNO: 8] 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAA 
CTACTGTCTTCACGCAGAAAGCGTCT 

AGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGA 
GAGCCATAGTGGTCTGCGGAACCGGT 

GAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCA 
ATGCCTGGAGATTTGGGCGTGCCCCC 

GCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCT 
GATAGGGTGCTTGCGAGTGCCCCGGG 

AGGTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAAC 
CAAAGGGCGCGCCATGATTGAACAAGA 

TGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGA 
CTGGGCACAACAGACAATCGGCTGCT 

CTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAA 
GACCGACCTGTCCGGTGCCCTGAAT 

GAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCC 
TTGCGCAGCTGTGCTCGACGTTGTCAC 

TGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCC 
TGTCATCTCACCTTGCTCCTGCCGAGA 

AAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTA 
CCTGCCCATTCGACCACCAAGCGAAA 

CATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGAT 
GATCTGGACGAAGAGCATCAGGGGCT 

CGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGG 
ATCTCGTCGTGACCCATGGCGATGCCT 

GCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTG 
GCCGGCTGGGTGTGGCGGACCGCTAT 

CAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGG 

TATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTC 
TTCTGAGTTTAAACAGACCACAACG 

G1TTCCCTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTA 
CTGGCCGAAGCCGCTTGGAATAAGG 

CCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATATTGCCGTCTTTTGGCAATG 
TGAGGGCCCGGAAACCTGGCCCTG 
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TCITCITGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAG 
GTCTGTTGAATGTCGTGAAGGAAGCA 

GTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGG 
CAGCGGAACCCCCCACCTGGCGACAG 

GTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCAC 
AACCCCAGTGCCACGTTGTGAGTTGGA 

TAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGA 
AGGATGCCCAGAAGGTACCCCATTGT 

ATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGG 
TTAAAAAACGTCTAGGCCCCCCGAAC 

CACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATACCATGGCGCCTATTAC 
GGCCTACTCCCAACAGACGCGAGGCC 

TACTTGGCTGCATCATCACTAGCCTCACAGGCCGGGACAGGAACCAGGTCGAGG 
GGGAGGTCCAAGTGGTCTCCACCGCA 

ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 
GTGCCGGCTCAAAGACCCTTGCCGG 

CCCAAAGGGCCCAATCACCCAAATGTACACCAATGTAGACCTGGACCTCGTCGG 
CTGGCAGGCGCCCCCCGGGGCGCGCT 

CCATGACACCATGCAGCTGTGGCAGCTCGGACCTTTACTTGGTCACGAGACATG 
CTGATGTCATTCCGGTGCGCCGGCGA 

GGCGACAGCAGGGGAAGTCTACTCTCCCCCAGGCCCGTCTCCTACCTGAAAGGC 
TCCTCGGGTGGTCCATTGCTTTGCCC 

TTCGGGGCACGTCGTGGGCGTCTTCCGGGCTGCTGTGTGCACCCGGGGGGTCGC 
GAAGGCGGTGGACTTCATACCCGTTG 

AGTCTATGGAAACTACCATGCGGTCTCCGGTCTTCACAGACAACTCAACCCCCC 
CGGCTGTACCGCAGACATTCCAAGTG 

GCACATCTGCACGCTCCTACTGGCAGCGGCAAGAGCACCAAAGTGCCGGCTGC 
GTATGCAGCCCAAGGGTACAAGGTGCT 

CGTCCTGAACCCGTCCGTTGCCGCCACCTTAGGGTTTGGGGCGTATATGTCCAA 
GGCACACGGTATCGACCCTAACATCA 

GAACTGGGGTAAGGACCATTACCACGGGCGGCTCCATTACGTACTCCACCTATG 
GCAAGTTCCTTGCCGACGGTGGCTGT 

TCTGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAACTGACTCG 
ACTACCATCTTGGGCATCGGCACAGT 

CCTGGACCAAGCGGAGACGGCTGGAGCGCGGCTCGTCGTGCTCGCCACCGCTA 
CACCTCCGGGATCGGTTACCGTGCCAC 

ACCCCAATATCGAGGAAATAGGCCTGTCCAACAATGGAGAGATCCCCTTCTATG 
GCAAAGCCATCCCCATTGAGGCCATC 

AAGGGGGGGAGGCATCTCATTTTCTGCCATTCCAAGAAGAAATGTGACGAGCTC 
GCCGCAAAGCTGACAGGCCTCGGACT 

GAACGCTGTAGCATATTACCGGGGCCTTGATGTGTCCGTCATACCGCCTATCGG 
AGACGTCGTTGTCGTGGCAACAGACG 
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CTCTAATGACGGGTTTCACCGGCGATTTTGACTCAGTGATCGACTGCAATACAT 
GTGTCACCCAGACAGTCGACTTCAGC 

TTGGATCCCACCTTCACCATTGAGACGACGACCGTGCCCCAAGACGCGGTGTCG 
CGCTCGCAACGGCGAGGTAGAACTGG 

CAGGGGTAGGAGTGGCATCTACAGGTTTGTGACTCCAGGAGAACGGCCCTCGG 
GCATGTTCGATTCTTCGGTCCTGTGTG 

AGTGCTATGACGCGGGCTGTGCTTGGTATGAGCTCACGCCCGCTGAGACCTCGG 
TTAGGTTGCGGGCTTACCTAAATACA 

CCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGGGAGAGCGTCTTCACA 
GGCCTCACCCACATAGATGCCCACTT 

CCTGTCCCAGACTAAACAGGCAGGAGACAACTTTCCTTACCTGGTGGCATATCA 
AGCTACAGTGTGCGCCAGGGCTCAAG 

CTCCACCTCCATCGTGGGACCAAATGTGGAAGTGTCTCATACGGCTGAAACCTA 
CACTGCACGGGCCAACACCCCTGCTG 

TATAGGCTAGGAGCCGTCCAAAATGAGGTCATCCTCACACACCCCATAACTAAA 
TACATCATGGCATGCATGTCGGCTGA 

CCTGGAGGTCGTCACTAGCACCTGGGTGCTGGTAGGCGGAGTCCTTGCAGCTTT 
GGCCGCATACTGCCTGACGACAGGCA 

GTGTGGTCATTGTGGGCAGGATCATCTTGTCCGGGAAGCCAGCTGTCGTTCCCG 
ACAGGGAAGTCCTCTACCAGGAGTTC 

GATGAGATGGAAGAGTGTGCCTCACAACTTCCTTACATCGAGCAGGGAATGCA 
GCTCGCCGAGCAATTCAAGCAAAAGGC 

GCTCGGGTTGTTGCAAACGGCCACCAAGCAAGCGGAGGCTGCTGCTCCCGTGGT 
GGAGTCCAAGTGGCGAGCCCTTGAGA 

CCTTCTGGGCGAAGCACATGTGGAATTTCATCAGCGGAATACAGTACCTAGCAG 
GCTTATCCACTCTGCCTGGAAACCCC 

GCGATAGCATCATTGATGGCATTTACAGCTTCTATCACTAGCCCGCTCACCACCC 
AAAACACCCTCCTGTTTAACATCTT 

GGGGGGATGGGTGGCTGCCCAACTCGCTCCTCCCAGCGCTGCGTCAGCTTTCGT 
GGGCGCCGGCATCGCCGGAGCGGCTG 

TTGGCAGCATAGGCCTTGGGAAGGTGCTCGTGGACATCTTGGCGGGCTATGGGG 
CAGGGGTAGCCGGCGCACTCGTGGCC 

TTTAAGGTCATGAGCGGCGAGGTGCCCTCCACCGAGGACCTGGTCAACTTACTC 
CCTGCCATCCTCTCTCCTGGTGCCCT 

GGTCGTCGGGGTCGTGTGCGCAGCAATACTGCGTCGGCACGTGGGCCCGGGAG 
AGGGGGCTGTGCAGTGGATGAACCGGC 

TGATAGCGTTCGCTTCGCGGGGTAACCACGTCTCCCCTACGCACTATGTGCCTG 
AGAGCGACGCTGCAGCACGTGTCACT 

CAGATCCTCTCTAGCCTTACCATCACTCAACTGCTGAAGCGGCTCCACCAGTGG 
ATTAATGAGGACTGCTCTACGCCATG 

CTCCGGCTCGTGGCTAAGGGATGTTTGGGATTGGATATGCACGGTGTTGACTGA 
CTTCAAGACCTGGCTCCAGTCCAAAC 
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TCCTGCCGCGGTTACCGGGAGTCCCTTTCCTGTCATGCCAACGCGGGTACAAGG 
GAGTCTGGCGGGGGGACGGCATCATG 

CAAACCACCTGCCCATGCGGAGCACAGATCGCCGGACATGTCAAAAACGGTTC 
CATGAGGATCGTAGGGCCTAGAACCTG 

CAGCAACACGTGGCACGGAACGTTCCCCATCAACGCATACACCACGGGACCTT 
GCACACCCTCCCCGGCGCCCAACTATT 

CCAGGGCGCTATGGCGGGTGGCTGCTGAGGAGTACGTGGAGGTTACGCGTGTG 
GGGGATTTCCACTACGTGACGGGCATG 

ACCACTGACAACGTAAAGTGCCCATGCCAGGTTCCGGCCCCCGAATTCTTCACG 
GAGGTGGATGGAGTGCGGTTGCACAG 

GTACGCTCCGGCGTGCAAACCTCTTCTACGGGAGGACGTCACGTTCCAGGTCGG 
GCTC AACC AAT ACTTGGTC GGGTCGC 

AGCTCCCATGCGAGCCCGAACCGGACGTAACAGTGCTTACTTCCATGCTCACCG 
ATCCCTCCCACATTACAGCAGAGACG 

GCTAAGCGTAGGCTGGCTAGAGGGTCTCCCCCCTCTTTAGCCAGCTCATCAGCT 
AtCCAGTTGTCTGCGCCTTCTTTGAA 

GGCGACATGCACTACCCACCATGACTCCCCGGACGCTGACCTCATCGAGGCCAA 
CCTCTTGTGGCGGCAGGAGATGGGCG 

GAAACATCACTCGCGTGGAGTCAGAGAATAAGGTAGTAATTCTGGACTCTTTCG 
AACCGCTTCACGCGGAGGGGGATGAG 

AGGGAGATATCCGTCGCGGCGGAGATCCTGCGAAAATCCAGGAAGTTCCCCTC 
AGCGTTGCCCATATGGGCACGCCCGGA 

CTACAATCCTCCACTGCTAGAGTCCTGGAAGGACCCGGACTACGTCCCTCCGGT 
GGTACACGGATGCCCATTGCCACCTA 

CCAAGGCTCCTCCAATACCACCTCCACGGAGAAAGAGGACGGTTGTCCTGACAG 
AATCCAATGTGTCTTCTGCCTTGGCG 

GAGCTCGCCACTAAGACCTTCGGTAGCTCCGGATCGTCGGCCGTTGATAGCGGC 
ACGGCGACCGCCCTTCCTGACCTGGC 

CTCCGACGACGGTGACAAAGGATCCGACGTTGAGTCGTACTCCTCCATGCCCCC 
CCTTGAAGGGGAGCCGGGGGACCCCG 

ATCTCAGCGACGGGTCTTGGTCTACCGTGAGTGAGGAGGCTAGTGAGGATGTCG 
TCTGCTGCTCAATGTCCTATACGTGG 

ACAGGCGCCCTGATCACGCCATGCGCTGCGGAGGAAAGTAAGCTGCCCATCAA 
CCCGTTGAGCAACTCTTTGCTGCGTCA 

CCACAACATGGTCTACGCCACAACATCCCGCAGCGCAAGCCTCCGGCAGAAGA 
AGGTCACCTTTGACAGATTGCAAGTCC 

TGGATGATCATTACCGGGACGTACTCAAGGAGATGAAGGCGAAGGCGTCCACA 
GTTAAGGCTAAGCTTCTATCTATAGAG 

GAGGCCTGCAAGCTGACGCCCCCACATTCGGCCAAATCCAAATTTGGCTATGGG 
GCAAAGGACGTCCGGAACCTATCCAG 

CAGGGCCGTTAACCACATCCGCTCCGTGTGGGAGGACTTGCTGGAAGACACTGA 
AACACCAATTGACACCACCATCATGG 
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CAAAAAATGAGGTTTTCTGCGTCCAACCAGAGAAGGGGGGCCGCAAGCCAGCT 
CGCCTTATCGTATTCCCAGATTTGGGG 

GTTCGTGTGTGCGAGAAAATGGCCCTTTACGATGTGGTCTCCACCCTCCCTCAG 
GCCGTGATGGGCTCTTCATACGGATT 

CCAATACTCTCCTGGACAGCGGGTCGAGTTCCTGGTGAATGCCTGGAAAGCGAA 
GAAATGCCCTATGGGCTTCGCATATG 

ACACCCGCTGTTTTGACTCAACGGTCACTGAGAATGACATCCGTGTTGAGGAGT 
CAATCTACCAATGTTGTGACTTGGCC 

CCCGAAGCCAGACAGGCCATAAGGTCGCTCACAGAGCGGCTTTACATCGGGGG 
CCCCCTGACTAATTCTAAAGGGCAGAA 

CTGCGGCTATCGCCGGTGCCGCGCGAGCGGTGTACTGACGACCAGCTGCGGTAA 
TACCCTCACATGTTACTTGAAGGCCG 

CTGCGGCCTGTCGAGCTGCGAAGCTCCAGGACTGCACGATGCTCGTATGCGGAG 
ACGACCTTGTCGTTATCTGTGAAAGC 

GCGGGGACCCAAGAGGACGAGGCGAGCCTACGGGCCTTCACGGAGGCTATGAC 
TAGATACTCTGCCCCCCCTGGGGACCC 

GCCCAAACCAGAATACGACTTGGAGTTGATAACATCATGCTCCTCCAATGTGTC 
AGTCGCGCACGATGCATCTGGCAAAA 

GGGTGTACTATCTCACCCGTGACCCCACCACCCCCCTTGCGCGGGCTGCGTGGG 
AGACAGCTAGACACACTCCAGTCAAT 

TCCTGGCTAGGCAACATCATCATGTATGCGCCCACCTTGTGGGCAAGGATGATC 
CTGATGACTCATTTCTTCTCCATCCT 

TCTAGCTCAGGAACAACTTGAAAAAGCCCTAGATTGTCAGATCTACGGGGCCTG 
TTACTCCATTGAGCCACTTGACCTAC 

CTCAGATCATTCAACGACTCCATGGCCTTAGCGCATTTTCACTCCATAGTTACTC 
TCCAGGTGAGATCAATAGGGTGGCT 

TCATGCCTCAGGAAACTTGGGGTACCGCCCTTGCGAGTCTGGAGACATCGGGCC 
AGAAGTGTCCGCGCTAGGCTACTGTC 

CCAGGGGGGGAGGGCTGCCACTTGTGGCAAGTACCTCTTCAACTGGGCAGTAA 
GGACCAAGCTCAAACTCACTCCAATCC 

CGGCTGCGTCCCAGTTGGATTTATCCAGCTGGTTCGTTGCTGGTTACAGCGGGG 
GAGACATATATCACAGCCTGTCTCGT 

GCCCGACCCCGCTGGTTCATGTGGTGCCTACTCCTACTTTCTGTAGGGGTAGGCA 
TCTATCTACTCCCCAACCGATGAAC 

GGGGACCTAAACACTCCAGGCCAATAGGCCATCCTGl lTl'l l iCCCi ri i i i n l 
TTCITTTTTTTTTTTTTTTTTTTTT 

ri^uuiiiirri cTC ci^uirriir ccTciuiiiiucci^i'iciii'CCTiTGGTGGCT 

CCATCTTAGCCCTAGTCACGGC 

TAGCTGTGAAAGGTCCGTGAGCCGCTTGACTGCAGAGAGTGCTGATACTGGCCT 
CTCTGCAGATCAAGT 
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J4B/Rl:[SEQIDNO:9] 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAA 
CTACTGTCTTCACGCAGAAAGCGTCT 

AGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGA 
GAGCCATAGTGGTCTGCGGAACCGGT 

GAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCA 
ATGCCTGGAGATTTGGGCGTGCCCCC 

GCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCT 
GATAGGGTGCTTGCGAGTGCCCCGGG 

AGGTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAAC 
CAAAGGGCGCGCCATGATTGAACAAGA 

TGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGA 
CTGGGCACAACAGACAATCGGCTGCT 

CTGATGCCGCCGTGTTCCGGCrGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAA 
GACCGACCTGTCCGGTGCCCTGAAT 

GAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCC 
TTGCGCAGCTGTGCTCGACGTTGTCAC 

TGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCC 
TGTCATCTCACCTTGCTCCTGCCGAGA 

AAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTA 
CCTGCCCATTCGACCACCAAGCGAAA 

CATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGAT 
GATCTGGACGAAGAGCATCAGGGGCT 

CGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGG 
ATCTCGTCGTGACCCATGGCGATGCCT 

GCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTG 
GCCGGCTGGGTGTGGCGGACCGCTAT 

CAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGG 

TATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTC 
TTCTGAGTTTAAACAGACCACAACG 

GTTTCCCTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTA 
CTGGCCGAAGCCGCTTGGAATAAGG 

CCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATATTGCCGTCTTTTGGCAATG 
TGAGGGCCCGGAAACCTGGCCCTG 

TCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAG 
GTCTGTTGAATGTCGTGAAGGAAGCA 

GTTCCTCTGGAAGCTTCrTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGG 
CAGCGGAACCCCCCACCTGGCGACAG 

GTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCAC 
AACCCCAGTGCCACGTTGTGAGTTGGA 
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TAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGA 
AGGATGCCCAGAAGGTACCCCATTGT 

ATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGG 
TTAAAAAACGTCTAGGCCCCCCGAAC 

CACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATACCATGGCGCCTATTAC 
GGCCTACTCCCAACAGACGCGAGGCC 

TACTTGGCTGCATCATCACTAGCCTCACAGGCCGGGACAGGAACCAGGTCGAGG 
GGGAGGTCCAAGTGGTCTCCACCGCA 

ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 
GTGCCGGCTCAAAGACCCTTGCCGG 

CCCAAAGGGCCCAATCACCCAAATGTACACCAATGTGGACCAGGACCTCGTCG 
GCTGGCAAGCGCCCCCCGGGGCGCGTT 

CCTTGACACCATGCACCTGCGGCAGCTCGGACCTTTACTTGGTCACGAGGCATG 
CCGATGTCATTCCGGTGCGCCGGCGG 

GGCGACAGCAGGGGGAGCCTACTCTCCCCCAGGCCCGTCTCCTACTTGAAGGGC 
TCTTCGGGCGGTCCACTGCTCTGCCC 

CTCGGGGCACGCTGTGGGCATCTTTCGGGCTGCCGTGTGCACCCGAGGGGTTGC 
GAAGGCGGTGGACTTTGTACCCGTCG 

AGTCTATGGAAACCACTATGCGGTCCCCGGTCTTCACGGACAACTCGTCCCCTC 
CGGCCGTACCGCAGACATTCCAGGTG 

GCCCATCTACACGCCCCTACTGGTAGCGGCAAGAGCACTAAGGTGCCGGCTGCG 
TATGCAGCCCAAGGGTATAAGGTGCT 

TGTCCTGAACCCGTCCGTCGCCGCCACCCTAGGTTTCGGGGCGTATATGTCTAA 
GGCACATGGTATCGACCCTAACATCA 

GAACCGGGGTAAGGACCATCACCACGGGTGCCCCCATCACGTACTCCACCTATG 
GCAAGTTTCTTGCCGACGGTGGTTGC 

TCTGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAACTGACTCG 
ACCACTATCCTGGGCATCGGCACAGT 

CCTGGACCAAGCGGAGACGGCTGGAGCGCGACTCGTCGTGCTCGCCACCGCTA 
CGCCTCCGGGATCGGTCACCGTGCCAC 

ATCCAAACATCGAGGAGGTGGCTCTGTCCAGCACTGGAGAAATCCCCTTTTATG 
GCAAAGCCATCCCCATCGAGACCATC 

AAGGGGGGGAGGCACCTCATTTTCTGCCATTCCAAGAAGAAATGTGATGAGCTC 
GCCGCGAAGCTGTCCGGCCTCGGACT 

CAATGCTGTAGCATATTACCGGGGCCTTGATGTATCCGTCATACCAACTAGCGG 
AGACGTCATTGTCGTAGCAACGGACG 

CTCTAATGACGGGCTTTACCGGCGATTTCGACTCAGTGATCGACTGCAATACAT 
GTGTCACCCAGACAGTCGACTTCAGC 

CTGGACCCGACCTTCACCATTGAGACGACGACCGTGCCACAAGACGCGGTGTCA 
CGCTCGCAGCGGCGAGGCAGGACTGG 

TAGGGGCAGGATGGGCATTTACAGGTTTGTGACTCCAGGAGAACGGCCCTCGG 
GCATGTTCGATTCCTCGGTTCTGTGCG 
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AGTGCTATGACGCGGGCTGTGCTTGGTACGAGCTCACGCCCGCCGAGACCTCAG 
TTAGGTTGCGGGCTTACCTAAACACA 

CCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGGGAGAGCGTCTTTACA 
GGCCTCACCCACATAGACGCCCATTT 

CTTGTCCCAGACTAAGCAGGCAGGAGACAACTTCCCCTACCTGGTAGCATACCA 
GGCTACGGTGTGCGCCAGGGCTCAGG 

CTCCACCTCCATCGTGGGACCAAATGTGGAAGTGTCTCATACGGCTAAAGCCTA 
CGCTGCACGGGCCAACGCCCCTGCTG 

TATAGGCTGGGAGCCGTTCAAAACGAGGTTACTACCACACACCCCATAACCAAA 
TACATCATGGCATGCATGTCGGCTGA 

CCTGGAGGTCGTCACGAGCACCTGGGTGCTGGTAGGCGGAGTCCTAGCAGCTCT 
GGCCGCGTATTGCCTGACAACAGGCA 

GCGTGGTCATTGTGGGCAGGATCATCTTGTCCGGAAAGCCGGCCATCATTCCCG 
ACAGGGAAGTCCTTTACCGGGAGTTC 

GATGAGATGGAAGAGTGCGCCTCACACCTCCCTTACATCGAACAGGGAATGCA 
GCTCGCCGAACAATTCAAACAGAAGGC 

AATCGGGTTGCTGCAAACAGCCACCAAGCAAGCGGAGGCTGCTGCTCCCGTGGT 
GGAATCCAAGTGGCGGACCCTCGAAG 

CCTTCTGGGCGAAGCATATGTGGAATTTCATCAGCGGGATACAATATTTAGCAG 
GCTTGTCCACTCTGCCTGGCAACCCC 

GCGATAGCATCACTGATGGCATTCACAGCCTCTATCACCAGCCCGCTCACCACC 
CAACATACCCTCCTGTTTAACATCCT 

GGGGGGATGGGTGGCCGCCCAACTTGCTCCTCCCAGCGCTGCTTCTGCTTTCGT 
AGGCGCCGGCATCGCTGGAGCGGCTG 

TTGGCAGCATAGGCCTTGGGAAGGTGCTTGTGGATATTTTGGCAGGTTATGGAG 
CAGGGGTGGCAGGCGCGCTCGTGGCC 

TTTAAGGTCATGAGCGGCGAGATGCCCTCCACCGAGGACCTGGTTAACCTACTC 
CCTGCTATCCTCTCCCCTGGCGCCCT 

AGTCGTCGGGGTCGTGTGCGCAGCGATACTGCGTCGGCACGTGGGCCCAGGGG 
AGGGGGCTGTGCAGTGGATGAACCGGC 

TGATAGCGTTCGCTTCGCGGGGTAACCACGTCTCCCCCACGCACTATGTGCCTG 
AGAGCGACGCTGCAGCACGTGTCACT 

CAGATCCTCTCTAGTCTTACCATCACTCAGCTGCTGAAGAGGCTTCACCAGTGG 
ATCAACGAGGACTGCTCCACGCCATG 

CTCCGGCTCGTGGCTAAGAGATGTTTGGGATTGGATATGCACGGTGTTGACTGA 
TTTCAAGACCTGGCTCCAGTCCAAGC 

TCCTGCCGCGATTGCCGGGAGTCCCCTTCTTCTCATGTCAACGTGGGTACAAGG 
GAGTCTGGCGGGGCGACGGCATCATG 

CAAACCACCTGCCCATGTGGAGCACAGATCACCGGACATGTGAAAAACGGTTC 
CATGAGGATCGTGGGGCCTAGGACCTG 

TAGTAACACGTGGCATGGAACATTCCCCATTAACGCGTACACCACGGGCCCCTG 
CACGCCCTCCCCGGCGCCAAATTATT 
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CTAGGGCGCTGTGGCGGGTGGCTGCTGAGGAGTACGTGGAGGTTACGCGGGTG 
GGGGATTTCCACTACGTGACGGGCATG 

ACCACTGACAACGTAAAGTGCCCGTGTCAGGTTCCGGCCCCCGAATTCTTCACG 
GAGGTGGATGGAGTGCGGTTGCACAG 

GTACGCTCCGGCGTGCAAACCTCTTCTACGGGAGGACGTCACGTTCCAGGTCGG 
GCTCAACCAATACTTGGTCGGGTCGC 

AGCTCCCATGCGAGCCCGAACCGGACGTAACAGTGCTTACTTCCATGCTCACCG 
ATCCCTCCCACATTACAGCAGAGACG 

GCTAAGCGTAGGCTGGCTAGAGGGTCTCCCCCCTCTTTAGCCAGCTCATCAGCT 
AtCCAGTTGTCTGCGCCTTCTTTGAA 

GGCGACATGCACTACCCACCATGACTCCCCGGACGCTGACCTCATCGAGGCCAA 
CCTCTTGTGGCGGCAGGAGATGGGCG 

GAAACATCACTCGCGTGGAGTCAGAGAATAAGGTAGTAATTCTGGACTCTTTCG 
AACCGCTTCACGCGGAGGGGGATGAG 

AGGGAGATATCCGTCGCGGCGGAGATCCTGCGAAAATCCAGGAAGTTCCCCTC 
AGCGTTGCCCATATGGGCACGCCCGGA 

CTACAATCCTCCACTGCTAGAGTCCTGGAAGGACCCGGACTACGTCCCTCCGGT 
GGTACACGGATGCCCATTGCCACCTA 

CCAAGGCTCCTCCAATACCACCTCCACGGAGAAAGAGGACGGTTGTCCTGACAG 
AATCCAATGTGTCTTCTGCCTTGGCG 

GAGCTCGCCACTAAGACCTTCGGTAGCTCCGGATCGTCGGCCGTTGATAGCGGC 
ACGGCGACCGCCCTTCCTGACCTGGC 

CTCCGACGACGGTGACAAAGGATCCGACGTTGAGTCGTACTCCTCCATGCCCCC 
CCTTGAAGGGGAGCCGGGGGACCCCG 

ATCTCAGCGACGGGTCTTGGTCTACCGTGAGTGAGGAGGCTAGTGAGGATGTCG 
TCTGCTGCTCAATGTCCTATACGTGG 

ACAGGCGCCCTGATCACGCCATGCGCTGCGGAGGAAAGTAAGCTGCCCATCAA 
CCCGTTGAGCAACTCTTTGCTGCGTCA 

CCACAACATGGTCTACGCCACAACATCCCGCAGCGCAAGCCTCCGGCAGAAGA 
AGGTCACCTTTGACAGATTGCAAGTCC 

TGGATGATCATTACCGGGACGTACTCAAGGAGATGAAGGCGAAGGCGTCCACA 
GTTAAGGCTAAGCTTCTATCTATAGAG 

GAGGCCTGCAAGCTGACGCCCCCACATTCGGCCAAATCCAAATTTGGCTATGGG 
GCAAAGGACGTCCGGAACCTATCCAG 

CAGGGCCGTTAACCACATCCGCTCCGTGTGGGAGGACTTGCTGGAAGACACTGA 
AACACCAATTGACACCACCATCATGG 

CAAAAAGTGAGGTTTTCTGCGTCCAACCAGAGAAGGGAGGCCGCAAGCCAGCT 
CGCCTTATCGTATTCCCAGACCTGGGA 

GTTCGTGTATGCGAGAAGATGGCCCTTTACGACGTGGTCTCCACCCTTCCTCAG 
GCCGTGATGGGCTCCTCATACGGATT 

TCAATACTCCCCCAAGCAGCGGGTCGAGTTCCTGGTGAATACCTGGAAATCAAA 
GAAATGCCCTATGGGCTTCTCATATG 
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ACACCCGCTGTTTTGACTCAACGGTCACTGAGAGTGACATTCGTGTTGAGGAGT 
CAATTTACCAATGTTGTGACTTGGCC 

CCCGAGGCCAGACAGGCCATAAGGTCGCTCACAGAGCGGCTTTACATCGGGGG 
TCCCCTGACTAACTCAAAAGGGCAGAA 

CTGCGGTTATCGCCGGTGCCGCGCAAGTGGCGTGCTGACGACTAGCTGCGGTAA 
TACCCTCACATGTTACTTGAAGGCCA 

CTGCAGCCTGTCGAGCTGCAAAGCTCCAGGACTGCACGATGCTCGTGAACGGAG 
ACGACCTTGTCGTTATCTGTGAAAGC 

GCGGGAACCCAGGAGGATGCGGCGGCCCTACGAGCCTTCACGGAGGCTATGAC 
TAGGTATTCCGCCCCCCCCGGGGATCC 

GCCCCAACCAGAATACGACCTGGAGCTGATAACATCATGTTCCTCCAATGTGTC 
AGTCGCGCACGATGCATCTGGCAAAA 

GGGTATACTACCTCACCCGTGACCCCACCACCCCCCTTGCACGGGCTGCGTGGG 
AGACAGCTAGACACACTCCAATCAAC 

TCTTGGCTAGGCAATATCATCATGTATGCGCCCACCCTATGGGCAAGGATGATT 
CTGATGACTCACTTTTTCTCCATCCT 

TCTAGCTCAAGAGCAACTTGAAAAAGCCCTGGATTGTCAGATCTACGGGGCTTG 
CTACTCCATTGAGCCACTTGACCTAC 

CTCAGATCATTGAACGACTCCATGGTCTTAGCGCATTTACACTCCACAGTTACTC 
TCCAGGTGAGATCAATAGGGTGGCT 

TCATGCCTCAGGAAACTTGGGGTACCACCCTTGCGAACCTGGAGACATCGGGCC 
AGAAGTGTCCGCGCTAAGCTACTGTC 

CCAGGGGGGGAGGGCCGCCACTTGTGGCAGATACCTCTTTAACTGGGCAGTAA 
GGACCAAGCTTAAACTCACTCCAATCC 

CGGCCGCGTCCCAGCTGGACTTGTCTGGCTGGTTCGTCGCTGGTTACAGCGGGG 
GAGACATATATCACAGCCTGTCTCGT 

GCCCGACCCCGCTGGTTTCCGTTGTGCCTACTCCTACTTTCTGTAGGGGTAGGCA 
TTTACCTGCTCCCCAACCGATGAAC 

GGGGAGCTAACCACTCCAGGCCTTAAGCCATTTCCTGl'rririliri'll'lllliri 
TTTTTTTTTCTTTTTTTTTTTCTT 

TCCTTTCCTTCTTTTTTTCCITTCTTTTTCCCTTCTT^ 
CCCTAGTCACGGCTAGCTGTGAA 

AGGTCCGTGAGCCGCATGACTGCAGAGAGTGCTGATACTGGCCTCTCTGCAGAT 
CAaGT 

J4B/R1 (C): [SEQ ID NO: 10] 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAA 
CTACTGTCTTCACGCAGAAAGCGTCT 

AGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGA 
GAGCCATAGTGGTCTGCGGAACCGGT 

GAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCA 
ATGCCTGGAGATTTGGGCGTGCCCCC 
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GCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCT 
GATAGGGTGCTTGCGAGTGCCCCGGG 

AGGTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAAC 
CAAAGGGCGCGCCATGATTGAACAAGA 

TGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGA 
CTGGGCACAACAGACAATCGGCTGCT 

CTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAA 
GACCGACCTGTCCGGTGCCCTGAAT 

GAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCC 
TTGCGCAGCTGTGCTCGACGTTGTCAC 

TGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCC 
TGTCATCTCACCTTGCTCCTGCCGAGA 

AAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTA 
CCTGCCCATTCGACCACCAAGCGAAA 

CATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGAT 
GATCTGGACGAAGAGCATCAGGGGCT 

CGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGG 
ATCTCGTCGTGACCCATGGCGATGCCT 

GCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTG 
GCCGGCTGGGTGTGGCGGACCGCTAT 

CAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGG 

TATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTC 
TTCTGAGTTTAAACAGACCACAACG 

GTTTCCCTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTA 
CTGGCCGAAGCCGCTTGGAATAAGG 

CCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATATTGCCGTCTTTTGGCAATG 
TGAGGGCCCGGAAACCTGGCCCTG 

TCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAG 
GTCTGTTGAATGTCGTGAAGGAAGCA 

GTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGG 
CAGCGGAACCCCCCACCTGGCGACAG 

GTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCAC 
AACCCCAGTGCCACGTTGTGAGTTGGA 

TAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGA 
AGGATGCCCAGAAGGTACCCCATTGT 

ATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGG 
TTAAAAAACGTCTAGGCCCCCCGAAC 

CACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATACCATGGCGCCTATTAC 
GGCCTACTCCCAACAGACGCGAGGCC 

TACTTGGCTGCATCATCACTAGCCTCACAGGCCGGGACAGGAACCAGGTCGAGG 
GGGAGGTCCAAGTGGTCTCCACCGCA 
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ACACAATCnTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 
GTGCCGGCTCAAAGACCCTTGCCGG 

CCCAAAGGGCCCAATCACCCAAATGTACACCAATGTGGACCAGGACCTCGTCG 
GCTGGCAAGCGCCCCCCGGGGCGCGTT 

CCTTGACACCATGCACCTGCGGCAGCTCGGACCTTTACTTGGTCACGAGGCATG 
CCGATGTCATTCCGGTGCGCCGGCGG 

GGCGACAGCAGGGGGAGCCTACTCTCCCCCAGGCCCGTCTCCTACTTGAAGGGC 
TCTTCGGGCGGTCCACTGCTCTGCCC 

CTCGGGGCACGCTGTGGGCATCTTTCGGGCTGCCGTGTGCACCCGAGGGGTTGC 
GAAGGCGGTGGACTTTGTACCCGTCG 

AGTCTATGGAAACCACTATGCGGTCCCCGGTCTTCACGGACAACTCGTCCCCTC 
CGGCCGTACCGCAGACATTCCAGGTG 

GCCCATCTACACGCCCCTACTGGTAGCGGCAAGAGCACTAAGGTGCCGGCTGCG 
TATGCAGCCCAAGGGTATAAGGTGCT 

TGTCCTGAACCCGTCCGTCGCCGCCACCCTAGGTTTCGGGGCGTATATGTCTAA 
GGCACATGGTATCGACCCTAACATCA 

GAACCGGGGTAAGGACCATCACCACGGGTGCCCCCATCACGTACTCCACCTATG 
GCAAGTTTCTTGCCGACGGTGGTTGC 

TCTGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAACTGACTCG 
ACCACTATCCTGGGCATCGGCACAGT 

CCTGGACCAAGCGGAGACGGCTGGAGCGCGACTCGTCGTGCTCGCCACCGCTA 
CGCCTCCGGGATCGGTCACCGTGCCAC 

ATCCAAACATCGAGGAGGTGGCTCTGTCCAGCACTGGAGAAATCCCCTTTTATG 
GCAAAGCCATCCCCATCGAGACCATC 

AAGGGGGGGAGGCACCTCATTTTCTGCCATTCCAAGAAGAAATGTGATGAGCTC 
GCCGCGAAGCTGTCCGGCCTCGGACT 

CAATGCTGTAGCATATTACCGGGGCCTTGATGTATCCGTCATACCAACTAGCGG 
AGACGTCATTGTCGTAGCAACGGACG 

CTCTAATGACGGGCTTTACCGGCGATTTCGACTCAGTGATCGACTGCAATACAT 
GTGTCACCCAGACAGTCGACTTCAGC 

CTGGACCCGACCTTCACCATTGAGACGACGACCGTGCCACAAGACGCGGTGTCA 
CGCTCGCAGCGGCGAGGCAGGACTGG 

TAGGGGCAGGATGGGCATTTACAGGTTTGTGACTCCAGGAGAACGGCCCTCGG 
GCATGTTCGATTCCTCGGTTCTGTGCG 

AGTGCTATGACGCGGGCTGTGCTTGGTACGAGCTCACGCCCGCCGAGACCTCAG 
TTAGGTTGCGGGCTTACCTAAACACA 

CCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGGGAGAGCGTCTTTACA 
GGCCTCACCCACATAGACGCCCATTT 

CTTGTCCCAGACTAAGCAGGCAGGAGACAACTTCCCCTACCTGGTAGCATACCA 
GGCTACGGTGTGCGCCAGGGCTCAGG 

CTCCACCTCCATCGTGGGACCAAATGTGGAAGTGTCTCATACGGCTAAAGCCTA 
CGCTGCACGGGCCAACGCCCCTGCTG 
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TATAGGCTGGGAGCCGTTCAAAACGAGGTTACTACCACACACCCCATAACCAAA 
TACATCATGGCATGCATGTCGGCTGA 

CCTGGAGGTCGTCACGAGCACCTGGGTGCTGGTAGGCGGAGTCCTAGCAGCTCT 
GGCCGCGTATTGCCTGACAACAGGCA 

GCGTGGTCATTGTGGGCAGGATCATCTTGTCCGGAAAGCCGGCCATCATTCCCG 
ACAGGGAAGTCCTTTACCGGGAGTTC 

GATGAGATGGAAGAGTGCGCCTCACACCTCCCTTACATCGAACAGGGAATGCA 
GCTCGCCGAACAATTCAAACAGAAGGC 

AATCGGGTTGCTGCAAACAGCCACCAAGCAAGCGGAGGCTGCTGCTCCCGTGGT 
GGAATCCAAGTGGCGGACCCTCGAAG 

CCTTCTGGGCGAAGCATATGTGGAATTTCATCAGCGGGATACAATATTTAGCAG 
GCTTGTCCACTCTGCCTGGCAACCCC 

GCGATAGCATCACTGATGGCATTCACAGCCTCTATCACCAGCCCGCTCACCACC 
CAACATACCCTCCTGTTTAACATCCT 

GGGGGGATGGGTGGCCGCCCAACITGCTCCTCCCAGCGCrGCTTCTGCTTTCGT 
AGGCGCCGGCATCGCTGGAGCGGCTG 

TTGGCAGCATAGGCCTTGGGAAGGTGCTTGTGGATATTTTGGCAGGTTATGGAG 
CAGGGGTGGCAGGCGCGCTCGTGGCC 

TTTAAGGTCATGAGCGGCGAGATGCCCTCCACCGAGGACCTGGTTAACCTACTC 
CCTGCTATCCTCTCCCCTGGCGCCCT 

AGTCGTCGGGGTCGTGTGCGCAGCGATACTGCGTCGGCACGTGGGCCCAGGGG 
AGGGGGCTGTGCAGTGGATGAACCGGC 

TGATAGCGTTCGCTTCGCGGGGTAACCACGTCTCCCCCACGCACTATGTGCCTG 
AGAGCGACGCTGCAGCACGTGTCACT 

CAGATCCTCTCTAGTCTTACCATCACTCAGCTGCTGAAGAGGCTTCACCAGTGG 
ATCAACGAGGACTGCTCCACGCCATG 

CTCCGGCTCGTGGCTAAGAGATGTTTGGGATTGGATATGCACGGTGTTGACTGA 
TTTCAAGACCTGGCTCCAGTCCAAGC 

TCCTGCCGCGATTGCCGGGAGTCCCCTTCTTCTCATGTCAACGTGGGTACAAGG 
GAGTCTGGCGGGGCGACGGCATCATG 

CAAACCACCTGCCCATGTGGAGCACAGATCACCGGACATGTGAAAAACGGTTC 
CATGAGGATCGTGGGGCCTAGGACCTG 

TAGTAACACGTGGCATGGAACATTCCCCATTAACGCGTACACCACGGGCCCCTG 
CACGCCCTCCCCGGCGCCAAATTATT 

CTAGGGCGCTGTGGCGGGTGGCTGCTGAGGAGTACGTGGAGGTTACGCGGGTG 
GGGGATTTCCACTACGTGACGGGCATG 

ACCACTGACAACGTAAAGTGCCCGTGTCAGGTTCCGGCCCCCGAATTCTTCACG 
GAGGTGGATGGAGTGCGGTTGCACAG 

GTACGCTCCGGCGTGCAAACCTCTTCTACGGGAGGACGTCACGTTCCAGGTCGG 
GCTCAACCAATACTTGGTCGGGTCGC 

AGCTCCCATGCGAGCCCGAACCGGACGTAACAGTGCTTACTTCCATGCTCACCG 
ATCCCTCCCACATTACAGCAGAGACG 
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GCTAAGCGTAGGCTGGCTAGAGGGTCTCCCCCCTCTTTAGCCAGCTCATCAGCT 
AtCCAGTTGTCTGCGCCTTCTTTGAA 

GGCGACATGCACTACCCACCATGACTCCCCGGACGCTGACCTCATCGAGGCCAA 
CCTCTTGTGGCGGCAGGAGATGGGCG 

GAAACATCACTCGCGTGGAGTCAGAGAATAAGGTAGTAATTCTGGACTCTTTCG 
AACCGCTTCACGCGGAGGGGGATGAG 

AGGGAGATATCCGTCGCGGCGGAGATCCTGCGAAAATCCAGGAAGTTCCCCTC 
AGCGTTGCCCATATGGGCACGCCCGGA 

CTACAATCCTCCACTGCTAGAGTCCTGGAAGGACCCGGACTACGTCCCTCCGGT 
GGTACACGGATGCCCATTGCCACCTA 

CCAAGGCTCCTCCAATACCACCTCCACGGAGAAAGAGGACGGTTGTCCTGACAG 
AATCCAATGTGTCTTCTGCCTTGGCG 

GAGCTCGCCACTAAGACCTTCGGTAGCTCCGGATCGTCGGCCGTTGATAGCGGC 
ACGGCGACCGCCCTTCCTGACCTGGC 

CTCCGACGACGGTGACAAAGGATCCGACGTTGAGTCGTACTCCTCCATGCCCCC 
CCTTGAAGGGGAGCCGGGGGACCCCG 

ATCTCAGCGACGGGTCTTGGTCTACCGTGAGTGAGGAGGCTAGTGAGGATGTCG 
TCTGCTGCTCAATGTCCTATACGTGG 

ACAGGCGCCCTGATCACGCCATGCGCTGCGGAGGAAAGTAAGCTGCCCATCAA 
CCCGTTGAGCAACTCTTTGCTGCGTCA 

CCACAACATGGTCTACGCCACAACATCCCGCAGCGCAAGCCTCCGGCAGAAGA 
AGGTCACCTTTGACAGATTGCAAGTCC 

TGGATGATCATTACCGGGACGTACTCAAGGAGATGAAGGCGAAGGCGTCCACA 
GTTAAGGCTAAGCTTCTATCTATAGAG 

GAGGCCTGCAAGCTGACGCCCCCACATTCGGCCAAATCCAAATTTGGCTATGGG 
GCAAAGGACGTCCGGAACCTATCCAG 

CAGGGCCGTTAACCACATCCGCTCCGTGTGGGAGGACTTGCTGGAAGACACTGA 
AACACCAATTGACACCACCATCATGG 

CAAAAAGTGAGGTTTTCTGCGTCCAACCAGAGAAGGGAGGCCGCAAGCCAGCT 
CGCCTTATCGTATTCCCAGACCTGGGA 

GTTCGTGTATGCGAGAAGATGGCCCTTTACGACGTGGTCTCCACCCTTCCTCAG 
GCCGTGATGGGCTCCTCATACGGATT 

TCAATACTCCCCCAAGCAGCGGGTCGAGTTCCTGGTGAATACCTGGAAATCAAA 
GAAATGCCCTATGGGCTTCTCATATG 

ACACCCGCTGTTTTGACTCAACGGTCACTGAGAGTGACATTCGTGTTGAGGAGT 
CAATTTACCAATGTTGTGACTTGGCC 

CCCGAGGCCAGACAGGCCATAAGGTCGCTCACAGAGCGGCTTTACATCGGGGG 
TCCCCTGACTAACTCAAAAGGGCAGAA 

CTGCGGTTATCGCCGGTGCCGCGCAAGTGGCGTGCTGACGACTAGCTGCGGTAA 
TACCCTCACATGTTACTTGAAGGCCA 

CTGCAGCCTGTCGAGCTGCAAAGCTCCAGGACTGCACGATGCTCGTGAACGGAG 
ACGACCTTGTCGTTATCTGTGAAAGC 
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GCGGGAACCCAGGAGGATGCGGCGGCCCTACGAGCCTTCACGGAGGCTATGAC 
TAGGTATTCCGCCCCCCCCGGGGATCC 

GCCCCAACCAGAATACGACCTGGAGCTGATAACATCATGTTCCTCCAATGTGTC 
AGTCGCGCACGATGCATCTGGCAAAA 

GGGTATACTACCTCACCCGTGACCCCACCACCCCCCTTGCACGGGCTGCGTGGG 
AGACAGCTAGACACACTCCAATCAAC 

TCTTGGCTAGGCAATATCATCATGTATGCGCCCACCCTATGGGCAAGGATGATT 
CTGATGACTCACTTTTTCTCCATCCT 

TCTAGCTCAAGAGCAACTTGAAAAAGCCCTGGATTGTCAGATCTACGGGGCTTG 
CTACTCCATTGAGCCACTTGACCTAC 

CTCAGATCATTGAACGACTCCATGGTCTTAGCGCATTTACACTCCACAGTTACTC 
TCCAGGTGAGATCAATAGGGTGGCT 

TCATGCCTCAGGAAACTTGGGGTACCACCCTTGCGAACCTGGAGACATCGGGCC 
AGAAGTGTCCGCGCTAAGCTACTGTC 

CCAGGGGGGGAGGGCCGCCACTTGTGGCAGATACCTCTTTAACTGGGCAGTAA 
GGACCAAGCTTAAACTCACTCCAATCC 

CGGCCGCGTCCCAGCTGGACTTGTCTGGCTGGTTCGTCGCTGGTTACAGCGGGG 
GAGACATATATCACAGCCTGTCTCGT 

GCCCGACCCCGCTGGTTTCCGTTGTGCCTACTCCTACTTTCTGTAGGGGTAGGCA 
TTTACCTGCTCCCCAACCGATGAAC 

GGGGAGCTAACCACTCCAGGCCTTAAGCCATITCCTGri'l'l'i ri l'l 1 1 1 1 1 1 1 1 1 1 
TTTTTTTTTCTTTTTTTTTTTCTI 

TCCTITCCIT CIU - lllirCCl^^ ' Cllll lCCCTTCTITAATGGTGGCTCCATCrTAG 
CCCTAGTCACGGCTAGCTGTGAA 

AGGTCCGTGAGCCGCtTGACTGCAGAGAGTGCTGATACTGGCCTCTCTGCAGATC 
AaGT 

J4 Replicon: [SEQ ID NO: 1 1 ] 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAA 
CTACTGTCTTCACGCAGAAAGCGTCT 

AGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGA 
GAGCCATAGTGGTCTGCGGAACCGGT 

GAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCA 
ATGCCTGGAGATTTGGGCGTGCCCCC 

GCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCT 
GATAGGGTGCTTGCGAGTGCCCCGGG 

AGGTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAAC 
CAAAGGGCGCGCCATGATTGAACAAGA 

TGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGA 
CTGGGCACAACAGACAATCGGCTGCT 

CTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAA 
GACCGACCTGTCCGGTGCCCTGAAT 
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GAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCC 
TTGCGCAGCTGTGCTCGACGTTGTCAC 

TGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCC 
TGTCATCTCACCTTGCTCCTGCCGAGA 

AAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTA 
CCTGCCCATTCGACCACCAAGCGAAA 

CATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGAT 
GATCTGGACGAAGAGCATCAGGGGCT 

CGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGG 
ATCTCGTCGTGACCCATGGCGATGCCT 

GCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTG 
GCCGGCTGGGTGTGGCGGACCGCTAT 

CAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGG 

TATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTC 
TTCTGAGTTTAAACAGACCACAACG 

GTTTCCCTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTA 
CTGGCCGAAGCCGCTTGGAATAAGG 

CCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATATTGCCGTCTTTTGGCAATG 
TGAGGGCCCGGAAACCTGGCCCTG 

TCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAG 
GTCTGTTGAATGTCGTGAAGGAAGCA 

GTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGG 
CAGCGGAACCCCCCACCTGGCGACAG 

GTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCAC 
AACCCCAGTGCCACGTTGTGAGTTGGA 

TAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGA 
AGGATGCCCAGAAGGTACCCCATTGT 

ATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGG 
TTAAAAAACGTCTAGGCCCCCCGAAC 

CACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATACCATGGCGCCTATTAC 
GGCCTACTCCCAACAGACGCGAGGCC 

TACTTGGCTGCATCATCACTAGCCTCACAGGCCGGGACAGGAACCAGGTCGAGG 
GGGAGGTCCAAGTGGTCTCCACCGCA 

ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 
GTGCCGGCTCAAAGACCCTTGCCGG 

CCCAAAGGGCCCAATCACCCAAATGTACACCAATGTAGACCTGGACCTCGTCGG 
CTGGCAGGCGCCCCCCGGGGCGCGCT 

CCATGACACCATGCAGCTGTGGCAGCTCGGACCTTTACTTGGTCACGAGACATG 
CTGATGTCATTCCGGTGCGCCGGCGA 

GGCGACAGCAGGGGAAGTCTACTCTCCCCCAGGCCCGTCTCCTACCTGAAAGGC 
TCCTCGGGTGGTCCATTGCTTTGCCC 
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TTCGGGGCACGTCGTGGGCGTCTTCCCXjGCTGCTGTGTGCACCCGGGGGGTCGC 
GAAGGCGGTGGACTTCATACCCGTTG 

AGTCTATGGAAACTACCATGCGGTCTCCGGTCTTCACAGACAACTCAACCCCCC 
CGGCTGTACCGCAGACATTCCAAGTG 

GCACATCTGCACGCTCCTACTGGCAGCGGCAAGAGCACCAAAGTGCCGGCTGC 
GTATGCAGCCCAAGGGTACAAGGTGCT 

CGTCCTGAACCCGTCCGTTGCCGCCACCTTAGGGTTTGGGGCGTATATGTCCAA 
GGCACACGGTATCGACCCTAACATCA 

GAACTGGGGTAAGGACCATTACCACGGGCGGCTCCATTACGTACTCCACCTATG 
GCAAGTTCCTTGCCGACGGTGGCTGT 

TCTGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAACTGACTCG 
ACTACCATCTTGGGCATCGGCACAGT 

CCTGGACCAAGCGGAGACGGCTGGAGCGCGGCTCGTCGTGCTCGCCACCGCTA 
CACCTCCGGGATCGGTTACCGTGCCAC 

ACCCCAATATCGAGGAAATAGGCCTGTCCAACAATGGAGAGATCCCCTTCTATG 
GCAAAGCCATCCCCATTGAGGCCATC 

AAGGGGGGGAGGCATCTCAnTTCTGCCATTCCAAGAAGAAATGTGACGAGCTC 
GCCGCAAAGCTGACAGGCCTCGGACT 

GAACGCTGTAGCATATTACCGGGGCCTTGATGTGTCCGTCATACCGCCTATCGG 
AGACGTCGTTGTCGTGGCAACAGACG 

CTCTAATGACGGGTTTCACCGGCGATTTTGACTCAGTGATCGACTGCAATACAT 
GTGTCACCCAGACAGTCGACTTCAGC 

TTGGATCCCACCTTCACCATTGAGACGACGACCGTGCCCCAAGACGCGGTGTCG 
CGCTCGCAACGGCGAGGTAGAACTGG 

CAGGGGTAGGAGTGGCATCTACAGGTTTGTGACTCCAGGAGAACGGCCCTCGG 
GCATGTTCGATTCTTCGGTCCTGTGTG 

AGTGCTATGACGCGGGCTGTGCTTGGTATGAGCTCACGCCCGCTGAGACCTCGG 
TTAGGTTGCGGGCTTACCTAAATACA 

CCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGGGAGAGCGTCTTCACA 
GGCCTCACCCACATAGATGCCCACTT 

CCTGTCCCAGACTAAACAGGCAGGAGACAACTTTCCTTACCTGGTGGCATATCA 
AGCTACAGTGTGCGCCAGGGCTCAAG 

CTCCACCTCCATCGTGGGACCAAATGTGGAAGTGTCTCATACGGCTGAAACCTA 
CACTGCACGGGCCAACACCCCTGCTG 

TATAGGCTAGGAGCCGTCCAAAATGAGGTCATCCTCACACACCCCATAACTAAA 
TACATCATGGCATGCATGTCGGCTGA 

CCTGGAGGTCGTCACTAGCACCTGGGTGCTGGTAGGCGGAGTCCTTGCAGCTTT 
GGCCGCATACTGCCTGACGACAGGCA 

GTGTGGTCATTGTGGGCAGGATCATCTTGTCCGGGAAGCCAGCTGTCGTTCCCG 
ACAGGGAAGTCCTCTACCAGGAGTTC 

GATGAGATGGAAGAGTGTGCCTCACAACTTCCTTACATCGAGCAGGGAATGCA 
GCTCGCCGAGCAATTCAAGCAAAAGGC 
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GCTCGGGTTGTTGCAAACGGCCACCAAGCAAGCGGAGGCTGCTGCTCCCGTGGT 
GGAGTCCAAGTGGCGAGCCCTTGAGA 

CCTTCTGGGCGAAGCACATGTGGAATTTCATCAGCGGAATACAGTACCTAGCAG 
GCTTATCCACTCTGCCTGGAAACCCC 

GCGATAGCATCATTGATGGCATTTACAGCTTCTATCACTAGCCCGCTCACCACCC 
AAAACACCCTCCTGTTTAACATCTT 

GGGGGGATGGGTGGCTGCCCAACTCGCTCCTCCCAGCGCTGCGTCAGCTTTCGT 
GGGCGCCGGCATCGCCGGAGCGGCTG 

TTGGCAGCATAGGCCTTGGGAAGGTGCTCGTGGACATCTTGGCGGGCTATGGGG 
CAGGGGTAGCCGGCGCACTCGTGGCC 

TTTAAGGTCATGAGCGGCGAGGTGCCCTCCACCGAGGACCTGGTCAACTTACTC 
CCTGCCATCCTCTCTCCTGGTGCCCT 

GGTCGTCGGGGTCGTGTGCGCAGCAATACTGCGTCGGCACGTGGGCCCGGGAG 
AGGGGGCTGTGCAGTGGATGAACCGGC 

TGATAGCGTTCGCTTCGCGGGGTAACCACGTCTCCCCTACGCACTATGTGCCTG 
AGAGCGACGCTGCAGCACGTGTCACT 

CAGATCCTCTCTAGCCTTACCATCACTCAACTGCTGAAGCGGCTCCACCAGTGG 
ATTAATGAGGACTGCTCTACGCCATG 

CTCCGGCTCGTGGCTAAGGGATGTTTGGGATTGGATATGCACGGTGTTGACTGA 
CTTCAAGACCTGGCTCCAGTCCAAAC 

TCCTGCCGCGGTTACCGGGAGTCCCTTTCCTGTCATGCCAACGCGGGTACAAGG 
GAGTCTGGCGGGGGGACGGCATCATG 

CAAACCACCTGCCCATGCGGAGCACAGATCGCCGGACATGTCAAAAACGGTTC 
CATGAGGATCGTAGGGCCTAGAACCTG 

CAGCAACACGTGGCACGGAACGTTCCCCATCAACGCATACACCACGGGACCTT 
GCACACCCTCCCCGGCGCCCAACTATT 

CCAGGGCGCTATGGCGGGTGGCTGCTGAGGAGTACGTGGAGGTTACGCGTGTG 
GGGGATTTCCACTACGTGACGGGCATG 

ACCACTGACAACGTAAAGTGCCCATGCCAGGTTCCGGCCCCCGAATTCTTCACG 
GAGGTGGATGGAGTGCGGTTGCACAG 

GTACGCTCCGGCGTGCAAACCTCTTCTACGGGAGGACGTCACGTTCCAGGTCGG 
GCTCAACCAATACTTGGTCGGGTCGC 

AGCTCCCATGCGAGCCCGAACCGGACGTAACAGTGCTTACTTCCATGCTCACCG 
ATCCCTCCCACATTACAGCAGAGACG 

GCTAAGCGTAGGCTGGCTAGAGGGTCTCCCCCCTCTTTAGCCAGCTCATCAGCT 
AtCCAGTTGTCTGCGCCTTCTTTGAA 

GGCGACATGCACTACCCACCATGACTCCCCGGACGCTGACCTCATCGAGGCCAA 
CCTCTTGTGGCGGCAGGAGATGGGCG 

GAAACATCACTCGCGTGGAGTCAGAGAATAAGGTAGTAATTCTGGACTCTTTCG 
AACCGCTTCACGCGGAGGGGGATGAG 

AGGGAGATATCCGTCGCGGCGGAGATCCTGCGAAAATCCAGGAAGTTCCCCTC 
AGCGTTGCCCATATGGGCACGCCCGGA 
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CTACAATCCTCCACTGCTAGAGTCCTGGAAGGACCCGGACTACGTCCCTCCGGT 
GGTACACGGATGCCCATTGCCACCTA 

CCAAGGCTCCTCCAATACCACCTCCACGGAGAAAGAGGACGGTTGTCCTGACAG 
AATCCAATGTGTCTTCTGCCTTGGCG 

GAGCTCGCCACTAAGACCTTCGGTAGCTCCGGATCGTCGGCCGTTGATAGCGGC 
ACGGCGACCGCCCTTCCTGACCTGGC 

CTCCGACGACGGTGACAAAGGATCCGACGTTGAGTCGTACTCCTCCATGCCCCC 
CCTTGAAGGGGAGCCGGGGGACCCCG 

ATCTCAGCGACGGGTCTTGGTCTACCGTGAGTGAGGAGGCTAGTGAGGATGTCG 
TCTGCTGCTCAATGTCCTATACGTGG 

ACAGGCGCCCTGATCACGCCATGCGCTGCGGAGGAAAGTAAGCTGCCCATCAA 
CCCGTTGAGCAACTCTTTGCTGCGTCA 

CCACAACATGGTCTACGCCACAACATCCCGCAGCGCAAGCCTCCGGCAGAAGA 
AGGTCACCTTTGACAGATTGCAAGTCC 

TGGATGATCATTACCGGGACGTACTCAAGGAGATGAAGGCGAAGGCGTCCACA 
GTTAAGGCTAAGCTTCTATCTATAGAG 

GAGGCCTGCAAGCTGACGCCCCCACATTCGGCCAAATCCAAATTTGGCTATGGG 
GCAAAGGACGTCCGGAACCTATCCAG 

CAGGGCCGTTAACCACATCCGCTCCGTGTGGGAGGACTTGCTGGAAGACACTGA 
AACACCAATTGACACCACCATCATGG 

CAAAAAGTGAGGTTTTCTGCGTCCAACCAGAGAAGGGAGGCCGCAAGCCAGCT 
CGCCTTATCGTATTCCCAGACCTGGGA 

GTTCGTGTATGCGAGAAGATGGCCCTTTACGACGTGGTCTCCACCCTTCCTCAG 
GCCGTGATGGGCTCCTCATACGGATT 

TCAATACTCCCCCAAGCAGCGGGTCGAGTTCCTGGTGAATACCTGGAAATCAAA 
GAAATGCCCTATGGGCTTCTCATATG 

ACACCCGCTGTTTTGACTCAACGGTCACTGAGAGTGACATTCGTGTTGAGGAGT 
CAATTTACCAATGTTGTGACTTGGCC 

CCCGAGGCCAGACAGGCCATAAGGTCGCTCACAGAGCGGCTTTACATCGGGGG 
TCCCCTGACTAACTCAAAAGGGCAGAA 

CTGCGGTTATCGCCGGTGCCGCGCAAGTGGCGTGCTGACGACTAGCTGCGGTAA 
TACCCTCACATGTTACTTGAAGGCCA 

CTGCAGCCTGTCGAGCTGCAAAGCTCCAGGACTGCACGATGCTCGTGAACGGAG 
ACGACCTTGTCGTTATCTGTGAAAGC 

GCGGGAACCCAGGAGGATGCGGCGGCCCTACGAGCCTTCACGGAGGCTATGAC 
TAGGTATTCCGCCCCCCCCGGGGATCC 

GCCCCAACCAGAATACGACCTGGAGCTGATAACATCATGTTCCTCCAATGTGTC 
AGTCGCGCACGATGCATCTGGCAAAA 

GGGTATACTACCTCACCCGTGACCCCACCACCCCCCTTGCACGGGCTGCGTGGG 
AGACAGCTAGACACACTCCAATCAAC 

TCTTGGCTAGGCAATATCATCATGTATGCGCCCACCCTATGGGCAAGGATGATT 
CTGATGACTCACTTTTTCTCCATCCT 
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TCTAGCTCAAGAGCAACTTGAAAAAGCCCTGGATTGTCAGATCTACGGGGCITG 
CTACTCCATTGAGCCACTTGACCTAC 

CTCAGATCATTGAACGACTCCATGGTCTTAGCGCATTTACACTCCACAGTTACTC 
TCCAGGTGAGATCAATAGGGTGGCT 

TCATGCCTCAGGAAACTTGGGGTACCACCCTTGCGAACCTGGAGACATCGGGCC 
AGAAGTGTCCGCGCTAAGCTACTGTC 

CCAGGGGGGGAGGGCCGCCACTTGTGGCAGATACCTCTTTAACTGGGCAGTAA 
GGACCAAGCTTAAACTCACTCCAATCC 

CGGCCGCGTCCCAGCTGGACTTGTCTGGCTGGTTCGTCGCTGGTTACAGCGGGG 
GAGACATATATCACAGCCTGTCTCGT 

GCCCGACCCCGCTGGTTTCCGTTGTGCCTACTCCTACTTTCTGTAGGGGTAGGCA 
TTTACCTGCTCCCCAACCGATGAAC 

GGGGAGCTAACCACTCCAGGCCTTAAGCCATTTCCTGT1-1 ri l l l l l l Ti l 11111 
TTTTTTTTTCTTTTTTTTTTTCTT 
TCCTTTCCrTC lTlTlTl C ClTlClTllT CCC^ 
CCCTAGTCACGGCTAGCTGTGAA 

AGGTCCGTGAGCCGCATGACTGCAGAGAGTGCTGATACTGGCCTCTCTGCAGAT 
CAaGT 

J4 replicon (c): [SEQ ID NO: 12] 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAA 
CTACTGTCTTCACGCAGAAAGCGTCT 

AGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGA 
GAGCCATAGTGGTCTGCGGAACCGGT 

GAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCA 
ATGCCTGGAGATTTGGGCGTGCCCCC 

GCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCT 
GATAGGGTGCTTGCGAGTGCCCCGGG 

AGGTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAAC 
CAAAGGGCGCGCCATGATTGAACAAGA 

TGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGA 
CTGGGCACAACAGACAATCGGCTGCT 

CTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAA 
GACCGACCTGTCCGGTGCCCTGAAT 

GAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCC 
TTGCGCAGCTGTGCTCGACGTTGTCAC 

TGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCC 
TGTCATCTCACCTTGCTCCTGCCGAGA 

AAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTA 
CCTGCCCATTCGACCACCAAGCGAAA 

CATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGAT 
GATCTGGACGAAGAGCATCAGGGGCT 
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CGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGG 
ATCTCGTCGTGACCCATGGCGATGCCT 

GCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTG 
GCCGGCTGGGTGTGGCGGACCGCTAT 

CAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGG 

TATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTC 
TTCTGAGTTTAAACAGACCACAACG 

GTTTCCCTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTA 
CTGGCCGAAGCCGCTTGGAATAAGG 

CCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATATTGCCGTCTTTTGGCAATG 
TGAGGGCCCGGAAACCTGGCCCTG 

TCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAG 
GTCTGTTGAATGTCGTGAAGGAAGCA 

GTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGG 
CAGCGGAACCCCCCACCTGGCGACAG 

GTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCAC 
AACCCCAGTGCCACGTTGTGAGTTGGA 

TAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGA 
AGGATGCCCAGAAGGTACCCCATTGT 

ATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGG 
TTAAAAAACGTCTAGGCCCCCCGAAC 

CACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATACCATGGCGCCTATTAC 
GGCCTACTCCCAACAGACGCGAGGCC 

TACTTGGCTGCATCATCACTAGCCTCACAGGCCGGGACAGGAACCAGGTCGAGG 
GGGAGGTCCAAGTGGTCTCCACCGCA 

ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 
GTGCCGGCTCAAAGACCCTTGCCGG 

CCCAAAGGGCCCAATCACCCAAATGTACACCAATGTAGACCTGGACCTCGTCGG 
CTGGCAGGCGCCCCCCGGGGCGCGCT 

CCATGACACCATGCAGCTGTGGCAGCTCGGACCTTTACTTGGTCACGAGACATG 
CTGATGTCATTCCGGTGCGCCGGCGA 

GGCGACAGCAGGGGAAGTCTACTCTCCCCCAGGCCCGTCTCCTACCTGAAAGGC 
TCCTCGGGTGGTCCATTGCTTTGCCC 

TTCGGGGCACGTCGTGGGCGTCTTCCGGGCTGCTGTGTGCACCCGGGGGGTCGC 
GAAGGCGGTGGACTTCATACCCGTTG 

AGTCTATGGAAACTACCATGCGGTCTCCGGTCTTCACAGACAACTCAACCCCCC 
CGGCTGTACCGCAGACATTCCAAGTG 

GCACATCTGCACGCTCCTACTGGCAGCGGCAAGAGCACCAAAGTGCCGGCTGC 
GTATGCAGCCCAAGGGTACAAGGTGCT 

CGTCCTGAACCCGTCCGTTGCCGCCACCTTAGGGTTTGGGGCGTATATGTCCAA 
GGCACACGGTATCGACCCTAACATCA 
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GAACTGGGGTAAGGACCATTACCACGGGCGGCTCCATTACGTACTCCACCTATG 
GCAAGTTCCTTGCCGACGGTGGCTGT 

TCTGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAACTGACTCG 
ACTACCATCTTGGGCATCGGCACAGT 

CCTGGACCAAGCGGAGACGGCTGGAGCGCGGCTCGTCGTGCTCGCCACCGCTA 
CACCTCCGGGATCGGTTACCGTGCCAC 

ACCCCAATATCGAGGAAATAGGCCTGTCCAACAATGGAGAGATCCCCTTCTATG 
GCAAAGCCATCCCCATTGAGGCCATC 

AAGGGGGGGAGGCATCTCATTTTCTGCCATTCCAAGAAGAAATGTGACGAGCTC 
GCCGCAAAGCTGACAGGCCTCGGACT 

GAACGCTGTAGCATATTACCGGGGCCTTGATGTGTCCGTCATACCGCCTATCGG 
AGACGTCGTTGTCGTGGCAACAGACG 

CTCTAATGACGGGTTTCACCGGCGATTTTGACTCAGTGATCGACTGCAATACAT 
GTGTCACCCAGACAGTCGACTTCAGC 

TTGGATCCCACCTTCACCATTGAGACGACGACCGTGCCCCAAGACGCGGTGTCG 
CGCTCGCAACGGCGAGGTAGAACTGG 

CAGGGGTAGGAGTGGCATCTACAGGTTTGTGACTCCAGGAGAACGGCCCTCGG 
GCATGTTCGATTCTTCGGTCCTGTGTG 

AGTGCTATGACGCGGGCTGTGCTTGGTATGAGCTCACGCCCGCTGAGACCTCGG 
TTAGGTTGCGGGCTTACCTAAATACA 

CCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGGGAGAGCGTCTTCACA 
GGCCTCACCCACATAGATGCCCACTT 

CCTGTCCCAGACTAAACAGGCAGGAGACAACTTTCCTTACCTGGTGGCATATCA 
AGCTACAGTGTGCGCCAGGGCTCAAG 

CTCCACCTCCATCGTGGGACCAAATGTGGAAGTGTCTCATACGGCTGAAACCTA 
CACTGCACGGGCCAACACCCCTGCTG 

TATAGGCTAGGAGCCGTCCAAAATGAGGTCATCCTCACACACCCCATAACTAAA 
TACATCATGGCATGCATGTCGGCTGA 

CCTGGAGGTCGTCACTAGCACCTGGGTGCTGGTAGGCGGAGTCCrrTGCAGCTTT 
GGCCGCATACTGCCTGACGACAGGCA 

GTGTGGTCATTGTGGGCAGGATCATCTTGTCCGGGAAGCCAGCTGTCGTTCCCG 
ACAGGGAAGTCCTCTACCAGGAGTTC 

GATGAGATGGAAGAGTGTGCCTCACAACTTCCTTACATCGAGCAGGGAATGCA 
GCTCGCCGAGCAATTCAAGCAAAAGGC 

GCTCGGGTTGTTGCAAACGGCCACCAAGCAAGCGGAGGCTGCTGCTCCCGTGGT 
GGAGTCCAAGTGGCGAGCCCTTGAGA 

CCTTCTGGGCGAAGCACATGTGGAATTTCATCAGCGGAATACAGTACCTAGCAG 
GCTTATCCACTCTGCCTGGAAACCCC 

GCGATAGCATCATTGATGGCATTTACAGCTTCTATCACTAGCCCGCTCACCACCC 
AAAACACCCTCCTGTTTAACATCTT 

GGGGGGATGGGTGGCTGCCCAACTCGCTCCTCCCAGCGCTGCGTCAGCTTTCGT 
GGGCGCCGGCATCGCCGGAGCGGCTG 
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TTGGCAGCATAGGCCTTGGGAAGGTGCTCGTGGACATCTTGGCGGGCTATGGGG 
CAGGGGTAGCCGGCGCACTCGTGGCC 

TTTAAGGTCATGAGCGGCGAGGTGCCCTCCACCGAGGACCTGGTCAACTTACTC 
CCTGCCATCCTCTCTCCTGGTGCCCT 

GGTCGTCGGGGTCGTGTGCGCAGCAATACTGCGTCGGCACGTGGGCCCGGGAG 
AGGGGGCTGTGCAGTGGATGAACCGGC 

TGATAGCGTTCGCTTCGCGGGGTAACCACGTCTCCCCTACGCACTATGTGCCTG 
AGAGCGACGCTGCAGCACGTGTCACT 

CAGATCCTCTCTAGCCTTACCATCACTCAACTGCTGAAGCGGCTCCACCAGTGG 
ATTAATGAGGACTGCTCTACGCCATG 

CTCCGGCTCGTGGCTAAGGGATGTTTGGGATTGGATATGCACGGTGTTGACTGA 
CTTCAAGACCTGGCTCCAGTCCAAAC 

TCCTGCCGCGGTTACCGGGAGTCCCTTTCCTGTCATGCCAACGCGGGTACAAGG 
GAGTCTGGCGGGGGGACGGCATCATG 

CAAACCACCTGCCCATGCGGAGCACAGATCGCCGGACATGTCAAAAACGGTTC 
CATGAGGATCGTAGGGCCTAGAACCTG 

CAGCAACACGTGGCACGGAACGTTCCCCATCAACGCATACACCACGGGACCTT 
GCACACCCTCCCCGGCGCCCAACTATT 

CCAGGGCGCTATGGCGGGTGGCTGCTGAGGAGTACGTGGAGGTTACGCGTGTG 
GGGGATTTCCACTACGTGACGGGCATG 

ACCACTGACAACGTAAAGTGCCCATGCCAGGTTCCGGCCCCCGAATTCTTCACG 
GAGGTGGATGGAGTGCGGTTGCACAG 

GTACGCTCCGGCGTGCAAACCTCTTCTACGGGAGGACGTCACGTTCCAGGTCGG 
GCTCAACCAATACTTGGTCGGGTCGC 

AGCTCCCATGCGAGCCCGAACCGGACGTAACAGTGCTTACTTCCATGCTCACCG 
ATCCCTCCCACATTACAGCAGAGACG 

GCTAAGCGTAGGCTGGCTAGAGGGTCTCCCCCCTCTTTAGCCAGCTCATCAGCT 
AtCCAGTTGTCTGCGCCTTCTTTGAA 

GGCGACATGCACTACCCACCATGACTCCCCGGACGCTGACCTCATCGAGGCCAA 
CCTCTTGTGGCGGCAGGAGATGGGCG 

GAAACATCACTCGCGTGGAGTCAGAGAATAAGGTAGTAATTCTGGACTCTTTCG 
AACCGCTTCACGCGGAGGGGGATGAG 

AGGGAGATATCCGTCGCGGCGGAGATCCTGCGAAAATCCAGGAAGTTCCCCTC 
AGCGTTGCCCATATGGGCACGCCCGGA 

CTACAATCCTCCACTGCTAGAGTCCTGGAAGGACCCGGACTACGTCCCTCCGGT 
GGTACACGGATGCCCATTGCCACCTA 

CCAAGGCTCCTCCAATACCACCTCCACGGAGAAAGAGGACGGTTGTCCTGACAG 
AATCCAATGTGTCTTCTGCCTTGGCG 

GAGCTCGCCACTAAGACCTTCGGTAGCTCCGGATCGTCGGCCGTTGATAGCGGC 
ACGGCGACCGCCCTTCCTGACCTGGC 

CTCCGACGACGGTGACAAAGGATCCGACGTTGAGTCGTACTCCTCCATGCCCCC 
CCTTGAAGGGGAGCCGGGGGACCCCG 
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ATCTCAGCGACGGGTCTTGGTCTACCGTGAGTGAGGAGGCTAGTGAGGATGTCG 
TCTGCTGCTCAATGTCCTATACGTGG 

ACAGGCGCCCTGATCACGCCATGCGCTGCGGAGGAAAGTAAGCTGCCCATCAA 
CCCGTTGAGCAACTCTTTGCTGCGTCA 

CCACAACATGGTCTACGCCACAACATCCCGCAGCGCAAGCCTCCGGCAGAAGA 
AGGTCACCTTTGACAGATTGCAAGTCC 

TGGATGATCATTACCGGGACGTACTCAAGGAGATGAAGGCGAAGGCGTCCACA 
GTTAAGGCTAAGCTTCTATCTATAGAG 

GAGGCCTGCAAGCTGACGCCCCCACATTCGGCCAAATCCAAATTTGGCTATGGG 
GCAAAGGACGTCCGGAACCTATCCAG 

CAGGGCCGTTAACCACATCCGCTCCGTGTGGGAGGACTTGCTGGAAGACACTGA 
AACACCAATTGACACCACCATCATGG 

CAAAAAGTGAGGTTTTCTGCGTCCAACCAGAGAAGGGAGGCCGCAAGCCAGCT 
CGCCTTATCGTATTCCCAGACCTGGGA 

GTTCGTGTATGCGAGAAGATGGCCCTTTACGACGTGGTCTCCACCCTTCCTCAG 
GCCGTGATGGGCTCCTCATACGGATT 

TCAATACTCCCCCAAGCAGCGGGTCGAGTTCCTGGTGAATACCTGGAAATCAAA 
GAAATGCCCTATGGGCTTCTCATATG 

ACACCCGCTGTTTTGACTCAACGGTCACTGAGAGTGACATTCGTGTTGAGGAGT 
CAATTTACCAATGTTGTGACTTGGCC 

CCCGAGGCCAGACAGGCCATAAGGTCGCTCACAGAGCGGCTTTACATCGGGGG 
TCCCCTGACTAACTCAAAAGGGCAGAA 

CTGCGGTTATCGCCGGTGCCGCGCAAGTGGCGTGCTGACGACTAGCTGCGGTAA 
TACCCTCACATGTTACTTGAAGGCCA 

CTGCAGCCTGTCGAGCTGCAAAGCTCCAGGACTGCACGATGCTCGTGAACGGAG 
ACGACCTTGTCGTTATCTGTGAAAGC 

GCGGGAACCCAGGAGGATGCGGCGGCCCTACGAGCCTTCACGGAGGCTATGAC 
TAGGTATTCCGCCCCCCCCGGGGATCC 

GCCCCAACCAGAATACGACCTGGAGCTGATAACATCATGTTCCTCCAATGTGTC 
AGTCGCGCACGATGCATCTGGCAAAA 

GGGTATACTACCTCACCCGTGACCCCACCACCCCCCTTGCACGGGCTGCGTGGG 
AGACAGCTAGACACACTCCAATCAAC 

TCTTGGCTAGGCAATATCATCATGTATGCGCCCACCCTATGGGCAAGGATGATT 
CTGATGACTCACTTTTTCTCCATCCT 

TCTAGCTCAAGAGCAACTTGAAAAAGCCCTGGATTGTCAGATCTACGGGGCTTG 
CTACTCCATTGAGCCACTTGACCTAC 

CTCAGATCATTGAACGACTCCATGGTCTTAGCGCATTTACACTCCACAGTTACTC 
TCCAGGTGAGATCAATAGGGTGGCT 

TCATGCCTCAGGAAACTTGGGGTACCACCCTTGCGAACCTGGAGACATCGGGCC 
AGAAGTGTCCGCGCTAAGCTACTGTC 

CCAGGGGGGGAGGGCCGCCACTTGTGGCAGATACCTCTTTAACTGGGCAGTAA 
GGACCAAGCTTAAACTCACTCCAATCC 
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CGGCCGCGTCCCAGCTGGACTTGTCTGGCTGGTrCGTCGCTGGTTACAGCGGGG 
GAGACATATATCACAGCCTGTCTCGT 

GCCCGACCCCGCTGGTTTCCGTTGTGCCTACTCCTACTTTCTGTAGGGGTAGGCA 
TTTACCTGCTCCCCAACCGATGAAC 

GGGGAGCTAACCACTCCAGGCCTTAAGCCATTTCCTGl'l'l'l lTl TTTI 1 1 1 1 1 1 1 1 
TTTTTTTTTCTTTTTTTTTTTCTT 

TCCTTTC CTTCTTTTTI 1 CCITTCTTTTTCCCITCTTTAATGGTGGCTCCATCTTAG 
CCCTAGTCACGGCTAGCTGTGAA 

AGGTCCGTGAGCCGCtTGACTGCAGAGAGTGCTGATACTGGCCTCTCTGCAGATC 
AaGT 

BB7/J4NSSB: [SEQ ID NO: 13] 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAA 
CTACTGTCTTCACGCAGAAAGCGTCT 

AGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGA 
GAGCCATAGTGGTCTGCGGAACCGGT 

GAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCA 
ATGCCTGGAGATTTGGGCGTGCCCCC 

GCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCT 
GATAGGGTGCTTGCGAGTGCCCCGGG 

AGGTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAAC 
CAAAGGGCGCGCCATGATTGAACAAGA 

TGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGA 
CTGGGCACAACAGACAATCGGCTGCT 

CTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCITTTTGTCAA 
GACCGACCTGTCCGGTGCCCTGAAT 

GAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCC 
TTGCGCAGCTGTGCTCGACGTTGTCAC 

TGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCC 
TGTCATCTCACCTTGCTCCTGCCGAGA 

AAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTA 
CCTGCCCATTCGACCACCAAGCGAAA 

CATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGAT 
GATCTGGACGAAGAGCATCAGGGGCT 

CGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGG 
ATCTCGTCGTGACCCATGGCGATGCCT 

GCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTG 
GCCGGCTGGGTGTGGCGGACCGCTAT 

CAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGG 

TATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTC 
TTCTGAGTTTAAACAGACCACAACG 
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GTTTCCCTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTA 
CTGGCCGAAGCCGCTTGGAATAAGG 

CCGGTGTGCGTTTGTCTATATGTTATTTrCCACCATATTGCCGTCTTTTGGCAATG 
TGAGGGCCCGGAAACCTGGCCCTG 

TCTTCITGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAG 
GTCTGTTGAATGTCGTGAAGGAAGCA 

GTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGG 
CAGCGGAACCCCCCACCTGGCGACAG 

GTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCAC 
AACCCCAGTGCCACGTTGTGAGTTGGA 

TAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGA 
AGGATGCCCAGAAGGTACCCCATTGT 

ATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGG 
TTAAAAAACGTCTAGGCCCCCCGAAC 

CACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATACCATGGCGCCTATTAC 
GGCCTACTCCCAACAGACGCGAGGCC 

TACTTGGCTGCATCATCACTAGCCTCACAGGCCGGGACAGGAACCAGGTCGAGG 
GGGAGGTCCAAGTGGTCTCCACCGCA 

ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 
GTGCCGGCTCAAAGACCCTTGCCGG 

CCCAAAGGGCCCAATCACCCAAATGTACACCAATGTGGACCAGGACCTCGTCG 
GCTGGCAAGCGCCCCCCGGGGCGCGTT 

CCTTGACACCATGCACCTGCGGCAGCTCGGACCTTTACTTGGTCACGAGGCATG 
CCGATGTCATTCCGGTGCGCCGGCGG 

GGCGACAGCAGGGGGAGCCTACTCTCCCCCAGGCCCGTCTCCTACTTGAAGGGC 
TCTTCGGGCGGTCCACTGCTCTGCCC 

CTCGGGGCACGCTGTGGGCATCTTTCGGGCTGCCGTGTGCACCCGAGGGGTTGC 
GAAGGCGGTGGACTTTGTACCCGTCG 

AGTCTATGGAAACCACTATGCGGTCCCCGGTCTTCACGGACAACTCGTCCCCTC 
CGGCCGTACCGCAGACATTCCAGGTG 

GCCCATCTACACGCCCCTACTGGTAGCGGCAAGAGCACTAAGGTGCCGGCTGCG 
TATGCAGCCCAAGGGTATAAGGTGCT 

TGTCCTGAACCCGTCCGTCGCCGCCACCCTAGGTTTCGGGGCGTATATGTCTAA 
GGCACATGGTATCGACCCTAACATCA 

GAACCGGGGTAAGGACCATCACCACGGGTGCCCCCATCACGTACTCCACCTATG 
GCAAGTTTCTTGCCGACGGTGGTTGC 

TCTGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAACTGACTCG 
ACCACTATCCTGGGCATCGGCACAGT 

CCTGGACCAAGCGGAGACGGCTGGAGCGCGACTCGTCGTGCTCGCCACCGCTA 
CGCCTCCGGGATCGGTCACCGTGCCAC 

ATCCAAACATCGAGGAGGTGGCTCTGTCCAGCACTGGAGAAATCCCCTTTTATG 
GCAAAGCCATCCCCATCGAGACCATC 
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AAGGGGGGGAGGCACCTCATTTTCTGCCATTCCAAGAAGAAATGTGATGAGCTC 
GCCGCGAAGCTGTCCGGCCTCGGACT 

CAATGCTGTAGCATATTACCGGGGCCTTGATGTATCCGTCATACCAACTAGCGG 
AGACGTCATTGTCGTAGCAACGGACG 

CTCTAATGACGGGCTTTACCGGCGATTTCGACTCAGTGATCGACTGCAATACAT 
GTGTCACCCAGACAGTCGACTTCAGC 

CTGGACCCGACCTTCACCATTGAGACGACGACCGTGCCACAAGACGCGGTGTCA 
CGCTCGCAGCGGCGAGGCAGGACTGG 

TAGGGGCAGGATGGGCATTTACAGGTTTGTGACTCCAGGAGAACGGCCCTCGG 
GCATGTTCGATTCCTCGGTTCTGTGCG 

AGTGCTATGACGCGGGCTGTGCTTGGTACGAGCTCACGCCCGCCGAGACCTCAG 
TTAGGTTGCGGGCTTACCTAAACACA 

CCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGGGAGAGCGTCTTTACA 
GGCCTCACCCACATAGACGCCCATTT 

CITGTCCCAGACTAAGCAGGCAGGAGACAACTTCCCCTACCTGGTAGCATACCA 
GGCTACGGTGTGCGCCAGGGCTCAGG 

CTCCACCTCCATCGTGGGACCAAATGTGGAAGTGTCTCATACGGCTAAAGCCTA 
CGCTGCACGGGCCAACGCCCCTGCTG 

TATAGGCTGGGAGCCGTTCAAAACGAGGTTACTACCACACACCCCATAACCAAA 
TACATCATGGCATGCATGTCGGCTGA 

CCTGGAGGTCGTCACGAGCACCTGGGTGCTGGTAGGCGGAGTCCTAGCAGCTCT 
GGCCGCGTATTGCCTGACAACAGGCA 

GCGTGGTCATTGTGGGCAGGATCATCTTGTCCGGAAAGCCGGCCATCATTCCCG 
ACAGGGAAGTCCTTTACCGGGAGTTC 

GATGAGATGGAAGAGTGCGCCTCACACCTCCCTTACATCGAACAGGGAATGCA 
GCTCGCCGAACAATTCAAACAGAAGGC 

AATCGGGTTGCTGCAAACAGCCACCAAGCAAGCGGAGGCTGCTGCTCCCGTGGT 
GGAATCCAAGTGGCGGACCCTCGAAG 

CCTTCTGGGCGAAGCATATGTGGAATTTCATCAGCGGGATACAATATTTAGCAG 
GCTTGTCCACTCTGCCTGGCAACCCC 

GCGATAGCATCACTGATGGCATTCACAGCCTCTATCACCAGCCCGCTCACCACC 
CAACATACCCTCCTGTTTAACATCCT 

GGGGGGATGGGTGGCCGCCCAACTTGCTCCTCCCAGCGCTGCTTCTGCTTTCGT 
AGGCGCCGGCATCGCTGGAGCGGCTG 

TTGGCAGCATAGGCCTTGGGAAGGTGCTTGTGGATATTTTGGCAGGTTATGGAG 
CAGGGGTGGCAGGCGCGCTCGTGGCC 

TTTAAGGTCATGAGCGGCGAGATGCCCTCCACCGAGGACCTGGTTAACCTACTC 
CCTGCTATCCTCTCCCCTGGCGCCCT 

AGTCGTCGGGGTCGTGTGCGCAGCGATACTGCGTCGGCACGTGGGCCCAGGGG 
AGGGGGCTGTGCAGTGGATGAACCGGC 

TGATAGCGTTCGCTTCGCGGGGTAACCACGTCTCCCCCACGCACTATGTGCCTG 
AGAGCGACGCTGCAGCACGTGTCACT 
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CAGATCCTCTCTAGTCTTACCATCACTCAGCTGCTGAAGAGGCTTCACCAGTGG 
ATCAACGAGGACTGCTCCACGCCATG 

CTCCGGCTCGTGGCTAAGAGATGTTTGGGATTGGATATGCACGGTGTTGACTGA 
TTTCAAGACCTGGCTCCAGTCCAAGC 

TCCTGCCGCGATTGCCGGGAGTCCCCTTCTTCTCATGTCAACGTGGGTACAAGG 
GAGTCTGGCGGGGCGACGGCATCATG 

CAAACCACCTGCCCATGTGGAGCACAGATCACCGGACATGTGAAAAACGGTTC 
CATGAGGATCGTGGGGCCTAGGACCTG 

TAGTAACACGTGGCATGGAACATTCCCCATTAACGCGTACACCACGGGCCCCTG 
CACGCCCTCCCCGGCGCCAAATTATT 

CTAGGGCGCTGTGGCGGGTGGCTGCTGAGGAGTACGTGGAGGTTACGCGGGTG 
GGGGATTTCCACTACGTGACGGGCATG 

ACCACTGACAACGTAAAGTGCCCGTGTCAGGTTCCGGCCCCCGAATTCTTCACA 
GAAGTGGATGGGGTGCGGTTGCACAG 

GTACGCTCCAGCGTGCAAACCCCTCCTACGGGAGGAGGTCACATTCCTGGTCGG 
GCTCAATCAATACCTGGTTGGGTCAC 

AGCTCCCATGCGAGCCCGAACCGGACGTAGCAGTGCTCACTTCCATGCTCACCG 
ACCCCTCCCACATTACGGCGGAGACG 

GCTAAGCGTAGGCTGGCCAGGGGATCTCCCCCCTCCTTGGCCAGCTCATCAGCT 
ATCCAGCTGTCTGCGCCTTCCTTGAA 

GGCAACATGCACTACCCGTCATGACTCCCCGGACGCTGACCTCATCGAGGCCAA 
CCTCCTGTGGCGGCAGGAGATGGGCG 

GGAACATCACCCGCGTGGAGTCAGAAAATAAGGTAGTAATTTTGGACTCTTTCG 
AGCCGCTCCAAGCGGAGGAGGATGAG 

AGGGAAGTATCCGTTCCGGCGGAGATCCTGCGGAGGTCCAGGAAATTCCCTCGA 
GCGATGCCCATATGGGCACGCCCGGA 

TTACAACCCTCCACTGTTAGAGTCCTGGAAGGACCCGGACTACGTCCCTCCAGT 
GGTACACGGGTGTCCATTGCCGCCTG 

CCAAGGCCCCTCCGATACCACCTCCACGGAGGAAGAGGACGGTTGTCCTGTCAG 
AATCTACCGTGTCTTCTGCCTTGGCG 

GAGCTCGCCACAAAGACCTTCGGCAGCTCCGAATCGTCGGCCGTCGACAGCGG 
CACGGCAACGGCCTCTCCTGACCAGCC 

CTCCGACGACGGCGACGCGGGATCCGACGTTGAGTCGTACTCCTCCATGCCCCC 
CCTTGAGGGGGAGCCGGGGGATCCCG 

ATCTCAGCGACGGGTCTTGGTCTACCGTAAGCGAGGAGGCTAGTGAGGACGTCG 
TCTGCTGCTCGATGTCCTACACATGG 

ACAGGCGCCCTGATCACGCCATGCGCTGCGGAGGAAAGTAAGCTGCCCATCAA 
CCCGTTGAGCAACTCTTTGCTGCGTCA 

CCACAACATGGTCTACGCCACAACATCCCGCAGCGCAAGCCTCCGGCAGAAGA 
AGGTCACCTTTGACAGATTGCAAGTCC 

TGGATGATCATTACCGGGACGTACTCAAGGAGATGAAGGCGAAGGCGTCCACA 
GTTAAGGCTAAGCTTCTATCTATAGAG 



83 



WO 03/085084 




PCT7US03/10177 



GAGGCCTGCAAGCTGACGCCCCCACATTCGGCCAAATCCAAATTTGGCTATGGG 
GCAAAGGACGTCCGGAACCTATCCAG 

CAGGGCCGTTAACCACATCCGCTCCGTGTGGGAGGACTTGCTGGAAGACACTGA 
AACACCAATTGACACCACCATCATGG 

CAAAAAGTGAGGTTTTCTGCGTCCAACCAGAGAAGGGAGGCCGCAAGCCAGCT 
CGCCTTATCGTATTCCCAGACCTGGGA 

GTTCGTGTATGCGAGAAGATGGCCCTTTACGACGTGGTCTCCACCCTTCCTCAG 
GCCGTGATGGGCTCCTCATACGGATT 

TCAATACTCCCCCAAGCAGCGGGTCGAGTTCCTGGTGAATACCTGGAAATCAAA 
GAAATGCCCTATGGGCTTCTCATATG 

ACACCCGCTGTTTTGACTCAACGGTCACTGAGAGTGACATTCGTGTTGAGGAGT 
CAATTTACCAATGTTGTGACTTGGCC 

CCCGAGGCCAGACAGGCCATAAGGTCGCTCACAGAGCGGCTTTACATCGGGGG 
TCCCCTGACTAACTCAAAAGGGCAGAA 

CTGCGGTTATCGCCGGTGCCGCGCAAGTGGCGTGCTGACGACTAGCTGCGGTAA 
TACCCTCACATGTTACTTGAAGGCCA 

CTGCAGCCTGTCGAGCTGCAAAGCTCCAGGACTGCACGATGCTCGTGAACGGAG 
ACGACCTTGTCGTTATCTGTGAAAGC 

GCGGGAACCCAGGAGGATGCGGCGGCCCTACGAGCCTTCACGGAGGCTATGAC 
TAGGTATTCCGCCCCCCCCGGGGATCC 

GCCCCAACCAGAATACGACCTGGAGCTGATAACATCATGTTCCTCCAATGTGTC 
AGTCGCGCACGATGCATCTGGCAAAA 

GGGTATACTACCTCACCCGTGACCCCACCACCCCCCTTGCACGGGCTGCGTGGG 
AGACAGCTAGACACACTCCAATCAAC 

TCTTGGCTAGGCAATATCATCATGTATGCGCCCACCCTATGGGCAAGGATGATT 
CTGATGACTCACTTTTTCTCCATCCT 

TCTAGCTCAAGAGCAACTTGAAAAAGCCCTGGATTGTCAGATCTACGGGGCTTG 
CTACTCCATTGAGCCACTTGACCTAC 

CTCAGATCATTGAACGACTCCATGGTCTTAGCGCATTTACACTCCACAGTTACTC 
TCCAGGTGAGATCAATAGGGTGGCT 

TCATGCCTCAGGAAACTTGGGGTACCACCCTTGCGAACCTGGAGACATCGGGCC 
AGAAGTGTCCGCGCTAAGCTACTGTC 

CCAGGGGGGGAGGGCCGCCACTTGTGGCAGATACCTCTTTAACTGGGCAGTAA 
GGACCAAGCTTAAACTCACTCCAATCC 

CGGCCGCGTCCCAGCTGGACTTGTCTGGCTGGTTCGTCGCTGGTTACAGCGGGG 
GAGACATATATCACAGCCTGTCTCGT 

GCCCGACCCCGCTGGTTTCCGTTGTGCCTACTCCTACTTTCTGTAGGGGTAGGCA 
TTTACCTGCTCCCCAACCGATGAAC 

GGGtAcgTAAACACTCCAGGCCAATAGGCCATCCTG'rri-l-l'nCCCl i l l inn M 

CTTTTTTTTTTTTTTTTI 11111 

TTTTTTTTTTTTC TC CTTTTTTTT^ 

CCATCTTAGCCCTAGTCACGGC 
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TAGCTGTGAAAGGTCCGTGAGCCGCTTGACTGCAGAGAGTGCTGATACTGGCCT 
CTCTGCAGATCAAGT 

pBB7-SN: [SEQ ID NO: 14] 
BB7-SN: 

GCCAGCCCCCGATTGGGGGCGACACTCCACCATAGATCACTCCCCTGTGAGGAA 
CTACTGTCTTCACGCAGAAAGCGTCT 

AGCCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGA 
GAGCCATAGTGGTCTGCGGAACCGGT 

GAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGATCAACCCGCTCA 
ATGCCTGGAGATTTGGGCGTGCCCCC 

GCGAGACTGCTAGCCGAGTAGTGTTGGGTCGCGAAAGGCCTTGTGGTACTGCCT 
GATAGGGTGCTTGCGAGTGCCCCGGG 

AGGTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAGAAAAAC 
CAAAGGGCGCGCCATGATTGAACAAGA 

TGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGA 
CTGGGCACAACAGACAATCGGCTGCT 

CTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAA 
GACCGACCTGTCCGGTGCCCTGAAT 

GAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCC 
TTGCGCAGCTGTGCTCGACGTTGTCAC 

TGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCC 
TGTCATCTCACCTTGCTCCTGCCGAGA 

AAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTA 
CCTGCCCATTCGACCACCAAGCGAAA 

CATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGAT 
GATCTGGACGAAGAGCATCAGGGGCT 

CGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGG 
ATCTCGTCGTGACCCATGGCGATGCCT 

GCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTG 
GCCGGCTGGGTGTGGCGGACCGCTAT 

CAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGG 
GCTGACCGCTTCCTCGTGCTTTACGG 

TATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACGAGTTC 
TTCTGAGTTTAAACAGACCACAACG 

GTTTCCCTCTAGCGGGATCAATTCCGCCCCTCTCCCTCCCCCCCCCCTAACGTTA 
CTGGCCGAAGCCGCTTGGAATAAGG 

CCGGTGTGCGTTTGTCTATATGTTATTTTCCACCATATTGCCGTCTTTTGGCAATG 
TGAGGGCCCGGAAACCTGGCCCTG 

TCTTCTTGACGAGCATTCCTAGGGGTCTTTCCCCTCTCGCCAAAGGAATGCAAG 
GTCTGTTGAATGTCGTGAAGGAAGCA 
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GTTCCTCTGGAAGCTTCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGG 
CAGCGGAACCCCCCACCTGGCGACAG 

GTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGCAC 
AACCCCAGTGCCACGTTGTGAGTTGGA 

TAGTTGTGGAAAGAGTCAAATGGCTCTCCTCAAGCGTATTCAACAAGGGGCTGA 
AGGATGCCCAGAAGGTACCCCATTGT 

ATGGGATCTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGG 
TTAAAAAACGTCTAGGCCCCCCGAAC 

CACGGGGACGTGGTTTTCCTTTGAAAAACACGATAATACCATGGCGCCTATTAC 
GGCCTACTCCCAACAGACGCGAGGCC 

TACTTGGCTGCATCATCACTAGCCTCACAGGCCGGGACAGGAACCAGGTCGAGG 
GGGAGGTCCAAGTGGTCTCCACCGCA 

ACACAATCTTTCCTGGCGACCTGCGTCAATGGCGTGTGTTGGACTGTCTATCATG 
GTGCCGGCTCAAAGACCCTTGCCGG 

CCCAAAGGGCCCAATCACCCAAATGTACACCAATGTGGACCAGGACCTCGTCG 
GCTGGCAAGCGCCCCCCGGGGCGCGTT 

CCTTGACACCATGCACCTGCGGCAGCTCGGACCTTTACTTGGTCACGAGGCATG 
CCGATGTCATTCCGGTGCGCCGGCGG 

GGCGACAGCAGGGGGAGCCTACTCTCCCCCAGGCCCGTCTCCTACTTGAAGGGC 
TCTTCGGGCGGTCCACTGCTCTGCCC 

CTCGGGGCACGCTGTGGGCATCTTTCGGGCTGCCGTGTGCACCCGAGGGGTTGC 
GAAGGCGGTGGACTTTGTACCCGTCG 

AGTCTATGGAAACCACTATGCGGTCCCCGGTCTTCACGGACAACTCGTCCCCTC 
CGGCCGTACCGCAGACATTCCAGGTG 

GCCCATCTACACGCCCCTACTGGTAGCGGCAAGAGCACTAAGGTGCCGGCTGCG 
TATGCAGCCCAAGGGTATAAGGTGCT 

TGTCCTGAACCCGTCCGTCGCCGCCACCCTAGGTTTCGGGGCGTATATGTCTAA 
GGCACATGGTATCGACCCTAACATCA 

GAACCGGGGTAAGGACCATCACCACGGGTGCCCCCATCACGTACTCCACCTATG 
GCAAGTTTCTTGCCGACGGTGGTTGC 

TCTGGGGGCGCCTATGACATCATAATATGTGATGAGTGCCACTCAACTGACTCG 
ACCACTATCCTGGGCATCGGCACAGT 

CCTGGACCAAGCGGAGACGGCTGGAGCGCGACTCGTCGTGCTCGCCACCGCTA 
CGCCTCCGGGATCGGTCACCGTGCCAC 

ATCCAAACATCGAGGAGGTGGCTCTGTCCAGCACTGGAGAAATCCCCTTTTATG 
GCAAAGCCATCCCCATCGAGACCATC 

AAGGGGGGGAGGCACCTCAnTTCTGCCATTCCAAGAAGAAATGTGATGAGCTC 
GCCGCGAAGCTGTCCGGCCTCGGACT 

CAATGCTGTAGCATATTACCGGGGCCTTGATGTATCCGTCATACCAACTAGCGG 
AGACGTCATTGTCGTAGCAACGGACG 

CTCTAATGACGGGCTTTACCGGCGATTTCGACTCAGTGATCGACTGCAATACAT 
GTGTCACCCAGACAGTCGACTTCAGC 
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CTGGACCCGACCTTCACCATTGAGACGACGACCGTGCCACAAGACGCGGTGTCA 
CGCTCGCAGCGGCGAGGCAGGACTGG 

TAGGGGCAGGATGGGCATTTACAGGTTTGTGACTCCAGGAGAACGGCCCTCGG 
GCATGTTCGATTCCTCGGTTCTGTGCG 

AGTGCTATGACGCGGGCTGTGCTTGGTACGAGCTCACGCCCGCCGAGACCTCAG 
TTAGGTTGCGGGCTTACCTAAACACA 

CCAGGGTTGCCCGTCTGCCAGGACCATCTGGAGTTCTGGGAGAGCGTCTTTACA 
GGCCTCACCCACATAGACGCCCATTT 

CTTGTCCCAGACTAAGCAGGCAGGAGACAACTTCCCCTACCTGGTAGCATACCA 
GGCTACGGTGTGCGCCAGGGCTCAGG 

CTCCACCTCCATCGTGGGACCAAATGTGGAAGTGTCTCATACGGCTAAAGCCTA 
CGCTGCACGGGCCAACGCCCCTGCTG 

TATAGGCTGGGAGCCGTTCAAAACGAGGTTACTACCACACACCCCATAACCAAA 
TACATCATGGCATGCATGTCGGCTGA 

CCTGGAGGTCGTCACGAGCACCTGGGTGCTGGTAGGCGGAGTCCTAGCAGCTCT 
GGCCGCGTATTGCCTGACAACAGGCA 

GCGTGGTCATTGTGGGCAGGATCATCTTGTCCGGAAAGCCGGCCATCATTCCCG 
ACAGGGAAGTCCTTTACCGGGAGTTC 

GATGAGATGGAAGAGTGCGCCTCACACCTCCCTTACATCGAACAGGGAATGCA 
GCTCGCCGAACAATTCAAACAGAAGGC 

AATCGGGTTGCTGCAAACAGCCACCAAGCAAGCGGAGGCTGCTGCTCCCGTGGT 
GGAATCCAAGTGGCGGACCCTCGAAG 

CCTTCTGGGCGAAGCATATGTGGAATTTCATCAGCGGGATACAATATTTAGCAG 
GCTTGTCCACTCTGCCTGGCAACCCC 

GCGATAGCATCACTGATGGCATTCACAGCCTCTATCACCAGCCCGCTCACCACC 
CAACATACCCTCCTGTTTAACATCCT 

GGGGGGATGGGTGGCCGCCCAACTTGCTCCTCCCAGCGCTGCTTCTGCTTTCGT 
AGGCGCCGGCATCGCTGGAGCGGCTG 

TTGGCAGCATAGGCCTTGGGAAGGTGCTTGTGGATATTTTGGCAGGTTATGGAG 
CAGGGGTGGCAGGCGCGCTCGTGGCC 

TTTAAGGTCATGAGCGGCGAGATGCCCTCCACCGAGGACCTGGTTAACCTACTC 
CCTGCTATCCTCTCCCCTGGCGCCCT 

AGTCGTCGGGGTCGTGTGCGCAGCGATACTGCGTCGGCACGTGGGCCCAGGGG 
AGGGGGCTGTGCAGTGGATGAACCGGC 

TGATAGCGTTCGCTTCGCGGGGTAACCACGTCTCCCCCACGCACTATGTGCCTG 
AGAGCGACGCTGCAGCACGTGTCACT 

CAGATCCTCTCTAGTCTTACCATCACTCAGCTGCTGAAGAGGCTTCACCAGTGG 
ATCAACGAGGACTGCTCCACGCCATG 

CTCCGGCTCGTGGCTAAGAGATGTTTGGGATTGGATATGCACGGTGTTGACTGA 
TTTCAAGACCTGGCTCCAGTCCAAGC 

TCCTGCCGCGATTGCCGGGAGTCCCCTTCTTCTCATGTCAACGTGGGTACAAGG 
GAGTCTGGCGGGGCGACGGCATCATG 
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CAAACCACCTGCCCATGTGGAGCACAGATCACCGGACATGTGAAAAACGGTTC 
CATGAGGATCGTGGGGCCTAGGACCTG 

TAGTAACACGTGGCATGGAACATTCCCCATTAACGCGTACACCACGGGCCCCTG 
CACGCCCTCCCCGGCGCCAAATTATT 

CTAGGGCGCTGTGGCGGGTGGCTGCTGAGGAGTACGTGGAGGTTACGCGGGTG 
GGGGATTTCCACTACGTGACGGGCATG 

ACCACTGACAACGTAAAGTGCCCGTGTCAGGTTCCGGCCCCCGAATTCTTCACA 
GAAGTGGATGGGGTGCGGTTGCACAG 

GTACGCTCCAGCGTGCAAACCCCTCCTACGGGAGGAGGTCACATTCCTGGTCGG 
GCTCAATCAATACCTGGTTGGGTCAC 

AGCTCCCATGCGAGCCCGAACCGGACGTAGCAGTGCTCACTTCCATGCTCACCG 
ACCCCTCCCACATTACGGCGGAGACG 

GCTAAGCGTAGGCTGGCCAGGGGATCTCCCCCCTCCTTGGCCAGCTCATCAGCT 
ATCCAGCTGTCTGCGCCTTCCTTGAA 

GGCAACATGCACTACCCGTCATGACTCCCCGGACGCTGACCTCATCGAGGCCAA 
CCTCCTGTGGCGGCAGGAGATGGGCG 

GGAACATCACCCGCGTGGAGTCAGAAAATAAGGTAGTAATTTTGGACTCTTTCG 
AGCCGCTCCAAGCGGAGGAGGATGAG 

AGGGAAGTATCCGTTCCGGCGGAGATCCTGCGGAGGTCCAGGAAATTCCCTCGA 
GCGATGCCCATATGGGCACGCCCGGA 

TTACAACCCTCCACTGTTAGAGTCCTGGAAGGACCCGGACTACGTCCCTCCAGT 
GGTACACGGGTGTCCATTGCCGCCTG 

CCAAGGCCCCTCCGATACCACCTCCACGGAGGAAGAGGACGGTTGTCCTGTCAG 
AATCTACCGTGTCTTCTGCCTTGGCG 

GAGCTCGCCACAAAGACCTTCGGCAGCTCCGAATCGTCGGCCGTCGACAGCGG 
CACGGCAACGGCCTCTCCTGACCAGCC 

CTCCGACGACGGCGACGCGGGATCCGACGTTGAGTCGTACTCCTCCATGCCCCC 
CCTTGAGGGGGAGCCGGGGGATCCCG 

ATCTCAGCGACGGGTCTTGGTCTACCGTAAGCGAGGAGGCTAGTGAGGACGTCG 
TCTGCTGCTCGATGTCCTACACATGG 

ACAGGCGCCCTGATCACGCCATGCGCTGCGGAGGAAACCAAGCTGCCCATCAA 
TGCACTGAGCAACTCTTTGCTCCGTCA 

CCACAACTTGGTCTATGCTACAACATCTCGCAGCGCAAGCCTGCGGCAGAAGAA 
GGTCACCTTTGACAGACTGCAGGTCC 

TGGACGACCACTACCGGGACGTGCTCAAGGAGATGAAGGCGAAGGCGTCCACA 
GTTAAGGCTAAACTTCTATCCGTGGAG 

GAAGCCTGTAAGCTGACGCCCCCACATTCGGCCAGATCTAAATTTGGCTATGGG 
GCAAAGGACGTCCGGAACCTATCCAG 

CAAGGCCGTTAACCACATCCGCTCCGTGTGGAAGGACTTGCTGGAAGACACTGA 
GACACCAATTGACACCACCATCATGG 

CAAAAAATGAGGTTTTCTGCGTCCAACCAGAGAAGGGGGGCCGCAAGCCAGCT 
CGCCTTATCGTATTCCCAGATTTGGGG 
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GTTCGTGTGTGCGAGAAAATGGCCCTTTACGATGTGGTCTCCACCCTCCCTCAG 
GCCGTGATGGGCTCTTCATACGGATT 

CCAATACTCTCCTGGACAGCGGGTCGAGTTCCTGGTGAATGCCTGGAAAGCGAA 
GAAATGCCCTATGGGCTTCGCATATG 

ACACCCGCTGTTTTGACTCAACGGTCACTGAGAATGACATCCGTGTTGAGGAGT 
CAATCTACCAATGTTGTGACTTGGCC 

CCCGAAGCCAGACAGGCCATAAGGTCGCTCACAGAGCGGCTTTACATCGGGGG 
CCCCCTGACTAATTCTAAAGGGCAGAA 

CTGCGGCTATCGCCGGTGCCGCGCGAGCGGTGTACTGACGACCAGCTGCGGTAA 
TACCCTCACATGTTACTTGAAGGCCG 

CTGCGGCCTGTCGAGCTGCGAAGCTCCAGGACTGCACGATGCTCGTATGCGGAG 
ACGACCTTGTCGTTATCTGTGAAAGC 

GCGGGgACCCAAGAGGACGAGGCGAGCCTACGGGCCTTCACGGAGGCTATGAC 
TAGATACTCTGCCCCCCCTGGGGACCC 

GCCCAAACCAGAATACGACTTGGAGTTGATAACATCATGCTCCTCCAATGTGTC 
AGTCGCGCACGATGCATCTGGCAAAA 

GGGTGTACTATCTCACCCGTGACCCCACCACCCCCCTTGCGCGGGCTGCGTGGG 
AGACAGCTAGACACACTCCAGTCAAT 

TCCTGGCTAGGCAACATCATCATGTATGCGCCCACCTTGTGGGCAAGGATGATC 
CTGATGACTCATTTCTTCTCCATCCT 

TCTAGCTCAGGAACAACTTGAAAAAGCCCTAGATTGTCAGATCTACGGGGCCTG 
TTACTCCATTGAGCCACTTGACCTAC 

CTCAGATCATTCAACGACTCCATGGCCTTAGCGCATTTTCACTCCATAGTTACTC 
TCCAGGTGAGATCAATAGGGTGGCT 

TCATGCCTCAGGAAACTTGGGGTACCGCCCTTGCGAGTCTGGAGACATCGGGCC 
AGAAGTGTCCGCGCTAGGCTACTGTC 

CCAGGGGGGGAGGGCTGCCACTTGTGGCAAGTACCTCTTCAACTGGGCAGTAA 
GGACCAAGCTCAAACTCACTCCAATCC 

CGGCTGCGTCCCAGTTGGATTTATCCAGCTGGTTCGTTGCTGGTTACAGCGGGG 
GAGACATATATCACAGCCTGTCTCGT 

GCCCGACCCCGCTGGTTCATGTGGTGCCTACTCCTACTTTCTGTAGGGGTAGGCA 
TCTATCTACTCCCCAACCGATGAAC 

GGGtACgTAAACACTCCAGGCCAATAGGCCATCCTGTTTTTTTCCCTTTTTTTTTT 

TCTTTTTTTTTTTTI 111111111 

TTTTTTTITITT CTC CriTITTTTTC CT 

CCATCTTAGCCCTAGTCACGGC 

TAGCTGTGAAAGGTCCGTGAGCCGCTTGACTGCAGAGAGTGCTGATACTGGCCT 
CTCTGCAGATCAAGT 

(C) Primers 

DNA primer sequences used in making the chimeric replicon constructs. Primers of the 
invention were derived from plasmids also described in the disclosure. 
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BG1000: CAT CCA GAT GTACA C CAA TGT GGA C [SEQ ID NO:31] 

BG1001 : CAT CGC CCG AAT TCT TCA CAG AAT TG [SEQ ID NO:32] 

BG1002: CAA TTC TGT GAA GAA TTC GGG CGA TG [SEQ ID NO:33] 

BG1003: GTA ACA CCA ATT GAC ACT ACC ATC [SEQ ID NO:34] 

BG1004: GAT GGT AGT GTC TAT TGG TGT TAC [SEQ ID NO:35] 

RB8000: GCA CTA GTA CTT GAT CTG CAG AGA GGC CAG TAT CAG CAC TCT 

CTG CAG TCA AGC GG [SEQ ID NO:36] 

J4-9841T: CTT TAG CCA GCT CAT CAG CTA TCC AGT TGT CTG CGC 
CTTC[SEQIDNO:37] 

J49841TR: GAA GGC GCA GAC AAC TGG ATA GCT GAT GAG CTG GCT 
AAA C[SEQ ID NO:38] 

laS9484I R: GAG ATG GAG CGG ACA GCT GGA TAG CCG AGG AGC TGG CCA 
TAG AAG [SEQ ID NO:39] 

1 aS9484I: CTT CTA TGG CCA GCT CCT CGG CTA TCC AGC TGT CCG CTC CAT 
CTC [SEQ ID NO:40] 

rb6000: CGT CTG CTG CTC GAT GTC CTA C [SEQ ID NO:41] 

RB7801 3' CTC CCC CAA CCG ATG AAC GGG TAC GTA AAC ACT CCA GGC CAA 
TAG [SEQ ID NO:42] 

1A 10501 CCT GGA CAG GCG CAC TGA TCA CC [SEQ ID NO:43] 

J4-10861 GAG GAC TTG CTG GAA GAC ACT G [SEQ ID NO:44] 

BB7980 CAG GAG TAC TTG ATC TGC AGA GAG GC [SEQ ID NO:45] 

BG1005: GCA CTA GTA CTT GAT CTG CAG AGA GGC [SEQ ID NO:46] 

RB7801 5' CTA TTG GCC TGG AGT GTT TAC GTA CCC GTT CAT CGG TTG GGG 

GAG [SEQ ID NO:47] 

Ralf Neo 5' TCA AGA CCG ACC TGT CCG GTG CCC [SEQ ID NO:48] 
Ralf Neo 3' CTT GAG CCT GGC GAA CAG TTC GGC [SEQ ID NO:49 
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GAPDH for ACC ACA GTC CAT GCC ATC AC [SEQ ID NO:50] 

GAPDH rev TCC ACC ACC CTG TTG CTG TA [SEQ ID NO:51] 

Neo R fwd CCG GCT ACC TGC CCA TTC [SEQ ID NO:52] 

Neo R rev CCA GAT CAT CCG ATC GAC AAG [SEQ ID NO:53] 

5'FAM-ACA TCG CAT CGA GCG AGC ACG TAC-TAMRA 3' [SEQ ID NO:54] 
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